1
|
Young LE, Nan Y, Jang E, Stevens R. Digital Epidemiological Approaches in HIV Research: a Scoping Methodological Review. Curr HIV/AIDS Rep 2023; 20:470-480. [PMID: 37917386 PMCID: PMC10719139 DOI: 10.1007/s11904-023-00673-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/11/2023] [Indexed: 11/04/2023]
Abstract
PURPOSE OF REVIEW The purpose of this scoping review was to summarize literature regarding the use of user-generated digital data collected for non-epidemiological purposes in human immunodeficiency virus (HIV) research. RECENT FINDINGS Thirty-nine papers were included in the final review. Four types of digital data were used: social media data, web search queries, mobile phone data, and data from global positioning system (GPS) devices. With these data, four HIV epidemiological objectives were pursued, including disease surveillance, behavioral surveillance, assessment of public attention to HIV, and characterization of risk contexts. Approximately one-third used machine learning for classification, prediction, or topic modeling. Less than a quarter discussed the ethics of using user-generated data for epidemiological purposes. User-generated digital data can be used to monitor, predict, and contextualize HIV risk and can help disrupt trajectories of risk closer to onset. However, more attention needs to be paid to digital ethics and the direction of the field in a post-Application Programming Interface (API) world.
Collapse
Affiliation(s)
- Lindsay E Young
- Annenberg School for Communication and Journalism, University of Southern California, 3502 Watt Way, Los Angeles, CA, 90089, USA.
| | - Yuanfeixue Nan
- Annenberg School for Communication and Journalism, University of Southern California, 3502 Watt Way, Los Angeles, CA, 90089, USA
| | - Eugene Jang
- Annenberg School for Communication and Journalism, University of Southern California, 3502 Watt Way, Los Angeles, CA, 90089, USA
| | - Robin Stevens
- Annenberg School for Communication and Journalism, University of Southern California, 3502 Watt Way, Los Angeles, CA, 90089, USA
| |
Collapse
|
2
|
Dolan E, Goulding J, Marshall H, Smith G, Long G, Tata LJ. Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models. Nat Commun 2023; 14:7258. [PMID: 37990023 PMCID: PMC10663456 DOI: 10.1038/s41467-023-42776-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 10/20/2023] [Indexed: 11/23/2023] Open
Abstract
The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R2 (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity.
Collapse
Affiliation(s)
- Elizabeth Dolan
- N/LAB, Nottingham University Business School, University of Nottingham, Nottingham, UK.
- Horizon Centre for Doctoral Training, University of Nottingham, Nottingham, UK.
| | - James Goulding
- N/LAB, Nottingham University Business School, University of Nottingham, Nottingham, UK
| | - Harry Marshall
- N/LAB, Nottingham University Business School, University of Nottingham, Nottingham, UK
| | - Gavin Smith
- N/LAB, Nottingham University Business School, University of Nottingham, Nottingham, UK
| | - Gavin Long
- N/LAB, Nottingham University Business School, University of Nottingham, Nottingham, UK
| | - Laila J Tata
- Lifespan and Population Health, School of Medicine, University of Nottingham, Nottingham, UK
| |
Collapse
|
3
|
Ferrara M, Gentili E, Belvederi Murri M, Zese R, Alberti M, Franchini G, Domenicano I, Folesani F, Sorio C, Benini L, Carozza P, Little J, Grassi L. Establishment of a Public Mental Health Database for Research Purposes in the Ferrara Province: Development and Preliminary Evaluation Study. JMIR Med Inform 2023; 11:e45523. [PMID: 37584563 PMCID: PMC10461404 DOI: 10.2196/45523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 05/04/2023] [Accepted: 06/01/2023] [Indexed: 08/17/2023] Open
Abstract
Background The immediate use of data exported from electronic health records (EHRs) for research is often limited by the necessity to transform data elements into an actual data set. Objective This paper describes the methodology for establishing a data set that originated from an EHR registry that included clinical, health service, and sociodemographic information. Methods The Extract, Transform, Load process was applied to raw data collected at the Integrated Department of Mental Health and Pathological Addictions in Ferrara, Italy, from 1925 to February 18, 2021, to build the new, anonymized Ferrara-Psychiatry (FEPSY) database. Information collected before the first EHR was implemented (ie, in 1991) was excluded. An unsupervised cluster analysis was performed to identify patient subgroups to support the proof of concept. Results The FEPSY database included 3,861,432 records on 46,222 patients. Since 1991, each year, a median of 1404 (IQR 1117.5-1757.7) patients had newly accessed care, and a median of 7300 (IQR 6109.5-9397.5) patients were actively receiving care. Among 38,022 patients with a mental disorder, 2 clusters were identified; the first predominantly included male patients who were aged 25 to 34 years at first presentation and were living with their parents, and the second predominantly included female patients who were aged 35 to 44 years and were living with their own families. Conclusions The process for building the FEPSY database proved to be robust and replicable with similar health care data, even when they were not originally conceived for research purposes. The FEPSY database will enable future in-depth analyses regarding the epidemiology and social determinants of mental disorders, access to mental health care, and resource utilization.
Collapse
Affiliation(s)
- Maria Ferrara
- Institute of Psychiatry, Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
- Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
| | | | - Martino Belvederi Murri
- Institute of Psychiatry, Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| | - Riccardo Zese
- Department of Chemical, Pharmaceutical and Agricultural Sciences, University of Ferrara, Ferrara, Italy
| | - Marco Alberti
- Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
| | - Giorgia Franchini
- Department of Physics, Informatics and Mathematics, University of Modena and Reggio Emilia, Modena, Italy
| | - Ilaria Domenicano
- Institute of Psychiatry, Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
| | - Federica Folesani
- Institute of Psychiatry, Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| | - Cristina Sorio
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| | - Lorenzo Benini
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| | - Paola Carozza
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| | - Julian Little
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada
| | - Luigi Grassi
- Institute of Psychiatry, Department of Neuroscience and Rehabilitation, University of Ferrara, Ferrara, Italy
- Integrated Department of Mental Health and Pathological Addictions, Ferrara Local Health Trust, Ferrara, Italy
| |
Collapse
|
4
|
Yang J, Zhou J, Luo T, Xie Y, Wei Y, Mai H, Yang Y, Cui P, Ye L, Liang H, Huang J. Predicting pulmonary tuberculosis incidence in China using Baidu search index: an ARIMAX model approach. Environ Health Prev Med 2023; 28:68. [PMID: 37926526 PMCID: PMC10636285 DOI: 10.1265/ehpm.23-00141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 09/30/2023] [Indexed: 11/07/2023] Open
Abstract
BACKGROUND Existing researches have established a correlation between internet search data and the epidemics of numerous infectious diseases. This study aims to develop a prediction model to explore the relationship between the Pulmonary Tuberculosis (PTB) epidemic trend in China and the Baidu search index. METHODS Collect the number of new cases of PTB in China from January 2011 to August 2022. Use Spearman rank correlation and interaction analysis to identify Baidu keywords related to PTB and construct a PTB comprehensive search index. Evaluate the predictive performance of autoregressive integrated moving average (ARIMA) and ARIMA with explanatory variable (ARIMAX) models for the number of PTB cases. RESULTS Incidence of PTB had shown a fluctuating downward trend. The Spearman rank correlation coefficient between the PTB comprehensive search index and its incidence was 0.834 (P < 0.001). The ARIMA model had an AIC value of 2804.41, and the MAPE value was 13.19%. The ARIMAX model incorporating the Baidu index demonstrated an AIC value of 2761.58 and a MAPE value of 5.33%. CONCLUSIONS The ARIMAX model is superior to ARIMA in terms of fitting and predicting accuracy. Additionally, the use of Baidu Index has proven to be effective in predicting cases of PTB.
Collapse
Affiliation(s)
- Jing Yang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Jie Zhou
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Tingyan Luo
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Yulan Xie
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Yiru Wei
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Huanzhuo Mai
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Yuecong Yang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Ping Cui
- Life Science Institute, Guangxi Medical University, Nanning, China
| | - Li Ye
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| | - Hao Liang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- Life Science Institute, Guangxi Medical University, Nanning, China
| | - Jiegang Huang
- Guangxi Key Laboratory of AIDS Prevention and Treatment, Nanning, China
- Guangxi Colleges and Universities Key Laboratory of Prevention and Control of Highly Prevalent Diseases, Guangxi Medical University, Nanning, China
- School of Public Health, Guangxi Medical University, Nanning, China
| |
Collapse
|
5
|
Ravalli S, Roggio F, Lauretta G, Di Rosa M, D'Amico AG, D'agata V, Maugeri G, Musumeci G. Exploiting real-world data to monitor physical activity in patients with osteoarthritis: the opportunity of digital epidemiology. Heliyon 2022; 8:e08991. [PMID: 35252602 PMCID: PMC8889133 DOI: 10.1016/j.heliyon.2022.e08991] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 11/22/2021] [Accepted: 02/16/2022] [Indexed: 12/15/2022] Open
Abstract
Osteoarthritis is a degenerative joint disease that affects millions of people worldwide. Current guidelines emphasize the importance of regular physical activity as a preventive measure against disease progression and as a valuable strategy for pain and functionality management. Despite this, most patients with osteoarthritis are inactive. Modern technological advances have led to the implementation of digital devices, such as wearables and smartphones, showing new opportunities for healthcare professionals and researchers to monitor physical activity and therefore engage patients in daily exercising. Additionally, digital devices have emerged as a promising tool for improving frequent health data collection, disease monitoring, and supporting public health surveillance. The leveraging of digital data has laid the foundation for developing a new concept of epidemiological study, known as "Digital Epidemiology". Analyzing real-world data can change the way we observe human behavior and suggest health interventions, as in the case of physical exercise and osteoarthritic patients. Furthermore, large-scale data could contribute to personalized and precision medicine in the future. Herein, an overview of recent clinical applications of wearables for monitoring physical activity in patients with osteoarthritis and the benefits of exploiting real-world data in the context of digital epidemiology are discussed.
Collapse
Affiliation(s)
- Silvia Ravalli
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy
| | - Federico Roggio
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy.,Department of Psychology, Educational Science and Human Movement, University of Palermo, Via Giovanni Pascoli 6, 90144 Palermo, Italy
| | - Giovanni Lauretta
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy
| | - Michelino Di Rosa
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy
| | - Agata Grazia D'Amico
- Department of Drug and Health Sciences, University of Catania, 95125 Catania, Italy
| | - Velia D'agata
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy
| | - Grazia Maugeri
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy
| | - Giuseppe Musumeci
- Department of Biomedical and Biotechnological Sciences, Human Anatomy and Histology Section, School of Medicine, University of Catania, Via S. Sofia 87, 95123 Catania, Italy.,Research Center on Motor Activities (CRAM), University of Catania, 95123 Catania, Italy.,Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA
| |
Collapse
|
6
|
Sahu KS, Majowicz SE, Dubin JA, Morita PP. NextGen Public Health Surveillance and the Internet of Things (IoT). Front Public Health 2021; 9:756675. [PMID: 34926381 PMCID: PMC8678116 DOI: 10.3389/fpubh.2021.756675] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 11/12/2021] [Indexed: 11/23/2022] Open
Abstract
Recent advances in technology have led to the rise of new-age data sources (e.g., Internet of Things (IoT), wearables, social media, and mobile health). IoT is becoming ubiquitous, and data generation is accelerating globally. Other health research domains have used IoT as a data source, but its potential has not been thoroughly explored and utilized systematically in public health surveillance. This article summarizes the existing literature on the use of IoT as a data source for surveillance. It presents the shortcomings of current data sources and how NextGen data sources, including the large-scale applications of IoT, can meet the needs of surveillance. The opportunities and challenges of using these modern data sources in public health surveillance are also explored. These IoT data ecosystems are being generated with minimal effort by the device users and benefit from high granularity, objectivity, and validity. Advances in computing are now bringing IoT-based surveillance into the realm of possibility. The potential advantages of IoT data include high-frequency, high volume, zero effort data collection methods, with a potential to have syndromic surveillance. In contrast, the critical challenges to mainstream this data source within surveillance systems are the huge volume and variety of data, fusing data from multiple devices to produce a unified result, and the lack of multidisciplinary professionals to understand the domain and analyze the domain data accordingly.
Collapse
Affiliation(s)
- Kirti Sundar Sahu
- School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada
| | - Shannon E. Majowicz
- School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada
| | - Joel A. Dubin
- School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada
- Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON, Canada
| | - Plinio Pelegrini Morita
- School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada
- Institute of Health Policy, Management, and Evaluation, University of Toronto, Toronto, ON, Canada
- Department of Systems Design Engineering, University of Waterloo, Waterloo, ON, Canada
- Ehealth Innovation, Techna Institute, University Health Network, Toronto, ON, Canada
- Research Institute for Aging, University of Waterloo, Waterloo, ON, Canada
| |
Collapse
|
7
|
Chong M, Park HW. COVID-19 in the Twitterverse, from epidemic to pandemic: information-sharing behavior and Twitter as an information carrier. Scientometrics 2021; 126:6479-6503. [PMID: 34188332 PMCID: PMC8221743 DOI: 10.1007/s11192-021-04054-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 05/19/2021] [Indexed: 12/03/2022]
Abstract
In this study, we defined a Twitter network as an information channel that includes information sources containing embedded messages. We conducted stage-based comparative analyses of Twitter networks during three periods: the beginning of the COVID-19 epidemic, the period when the epidemic was becoming a global phenomenon, and the beginning of the pandemic. We also analyzed the characteristics of scientific information sources and content on Twitter during the sample period. At the beginning of the epidemic, Twitter users largely shared trustworthy news information sources about the novel coronavirus. Widely shared scientific information focused on clinical investigations and case studies of the new coronavirus as the disease became a pandemic while non-scientific information sources and messages illustrated the social and political aspects of the global outbreak, often including emotional elements. Multiple suspicious, bot-like Twitter accounts were identified as a great connector of the COVID-19 Twitterverse, particularly in the beginning of the global crisis. Our findings suggest that the information carriers, which are information channels, sources, and messages were coherently interlocked, forming an information organism. The study results can help public health organizations design communication strategies, which often require prompt decision-making to manage urgent needs under the circumstances of an epidemic.
Collapse
Affiliation(s)
- Miyoung Chong
- Deliberative Media Lab, University of Virginia, 1605 Jefferson Park Ave., Charlottesville, VA 22904 USA
| | - Han Woo Park
- Department of Media & Communication, Interdisciplinary Graduate Programs of Digital Convergence Business and East Asian Cultural Studies, Founder of the Cyber Emotions Research Institute, YeungNam University, 280 Daehak-Ro, Gyeongsangbuk-do Gyeongsan-si, 38541 South Korea
| |
Collapse
|
8
|
Yasseen AS, Kwong JC, Kustra R, Holder L, Chung H, Macdonald L, Janjua NZ, Mazzulli T, Feld J, Crowcroft NS. Validating viral hepatitis B and C diagnosis codes: a retrospective analysis using Ontario's health administrative data. CANADIAN JOURNAL OF PUBLIC HEALTH = REVUE CANADIENNE DE SANTE PUBLIQUE 2021; 112:502-512. [PMID: 33417192 PMCID: PMC8076389 DOI: 10.17269/s41997-020-00435-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 10/11/2020] [Indexed: 12/22/2022]
Abstract
OBJECTIVE We aimed to determine the criterion validity of using diagnosis codes for hepatitis B virus (HBV) and hepatitis C virus (HCV) to identify infections. METHODS Using linked laboratory and administrative data in Ontario, Canada, from January 2004 to December 2014, we validated HBV/HCV diagnosis codes against laboratory-confirmed infections. Performance measures (sensitivity, specificity, and positive predictive value) were estimated via cross-validated logistic regression and we explored variations by varying time windows from 1 to 5 years before (i.e., prognostic prediction) and after (i.e., diagnostic prediction) the date of laboratory confirmation. Subgroup analyses were performed among immigrants, males, baby boomers, and females to examine the robustness of these measures. RESULTS A total of 1,599,023 individuals were tested for HBV and 840,924 for HCV, with a resulting 41,714 (2.7%) and 58,563 (7.0%) infections identified, respectively. HBV/HCV diagnosis codes ± 3 years of laboratory confirmation showed high specificity (99.9% HBV; 99.8% HCV), moderate positive predictive value (70.3% HBV; 85.8% HCV), and low sensitivity (12.8% HBV; 30.8% HCV). Varying the time window resulted in limited changes to performance measures. Diagnostic models consistently outperformed prognostic models. No major differences were observed among subgroups. CONCLUSION HBV/HCV codes should not be the only source used for monitoring the population burden of these infections, due to low sensitivity and moderate positive predictive values. These results underscore the importance of ongoing laboratory and reportable disease surveillance systems for monitoring viral hepatitis in Ontario.
Collapse
Affiliation(s)
- Abdool S Yasseen
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Public Health Ontario, Toronto, Canada
- ICES, Toronto, Canada
| | - Jeffrey C Kwong
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Public Health Ontario, Toronto, Canada
- ICES, Toronto, Canada
- University Health Network, Toronto, Canada
- Department of Family and Community Medicine, University of Toronto, Toronto, Canada
| | - Rafal Kustra
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | | | | | - Liane Macdonald
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Public Health Ontario, Toronto, Canada
| | - Naveed Z Janjua
- Hepatitis Testers Cohort, British Columbia Centre for Diseases Control, Vancouver, Canada
| | - Tony Mazzulli
- Public Health Ontario, Toronto, Canada
- Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
- Department of Microbiology, Mount Sinai Hospital/University Health Network, Toronto, Canada
| | - Jordan Feld
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Public Health Ontario, Toronto, Canada
- ICES, Toronto, Canada
- University Health Network, Toronto, Canada
| | - Natasha S Crowcroft
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada.
- ICES, Toronto, Canada.
- Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada.
- Department of Microbiology, Mount Sinai Hospital/University Health Network, Toronto, Canada.
| |
Collapse
|
9
|
Abstract
The articles in this special issue of AIDS focus on the application of the so-called Big Data science (BDS) as applied to a variety of HIV-applied research questions in the sphere of health services and epidemiology. Recent advances in technology means that a critical mass of HIV-related health data with actionable intelligence is available for optimizing health outcomes, improving and informing surveillance. Data science will play a key but complementary role in supporting current efforts in prevention, diagnosis, treatment, and response needed to end the HIV epidemic. This collection provides a glimpse of the promise inherent in leveraging the digital age and improved methods in Big Data science to reimagine HIV treatment and prevention in a digital age.
Collapse
Affiliation(s)
- Bankole Olatosi
- Big Data Health Science Center, University of South Carolina, Columbia, SC 29208
- Department of Health Services Policy and Management, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208
| | - Sten H. Vermund
- School of Public Health, Yale University, New Haven, CT 06510
| | - Xiaoming Li
- Big Data Health Science Center, University of South Carolina, Columbia, SC 29208
- Department of Health Promotion, Behavior and Education, Arnold School of Public Health, University of South Carolina, Columbia, SC 29208
| |
Collapse
|
10
|
Wang Y, Zhao H. Digital data-based strategies: A novel form of better understanding COVID-19 pandemic and international scientific collaboration. PLoS One 2021; 16:e0249280. [PMID: 33793613 PMCID: PMC8016224 DOI: 10.1371/journal.pone.0249280] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/15/2021] [Indexed: 12/13/2022] Open
Abstract
International scientific collaborations have always been regarded as critical actions to address global pandemics, however, there was an obvious uncertainty between international collaboration and the COVID-19 control. We aim to combine digital data-based strategies to produce meaningful and advanced insights into the imbalance between COVID-19 and international collaboration, as well as reveal possible influencing factors, and ultimately enhance global collaboration. We conducted three retrospective cohort studies using respectively COVID-19 data from WHO, a complete dataset of scientific publications on coronavirus-related research from WoS, and daily data from Google Trends (GT). The results of geovisualization and spatiotemporal analysis revealed that the global COVID19 pandemic still remains serious. The global issue of imbalance between international collaborations and pandemic does exit, and the nations with good pandemic control had their own characteristics in above-mentioned correlation. Digital epidemiology provides, at least in part, evidence-based assessment and scientific advice to understand the imbalance between international collaborations and COVID-19. Our investigation demonstrates that transdisciplinary conversation through digital data-based strategies can help us fully understand the complex factors influencing the effectiveness of international scientific collaboration, thus facilitating the global response to COVID-19.
Collapse
Affiliation(s)
- Yan Wang
- Scientific Research Center, The Second Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Henan Zhao
- Department of Pathophysiology, College of Basic Medical Sciences, Dalian Medical University, Dalian, Liaoning, China
| |
Collapse
|
11
|
Shakeri Hossein Abad Z, Kline A, Sultana M, Noaeen M, Nurmambetova E, Lucini F, Al-Jefri M, Lee J. Digital public health surveillance: a systematic scoping review. NPJ Digit Med 2021; 4:41. [PMID: 33658681 PMCID: PMC7930261 DOI: 10.1038/s41746-021-00407-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 01/21/2021] [Indexed: 02/06/2023] Open
Abstract
The ubiquitous and openly accessible information produced by the public on the Internet has sparked an increasing interest in developing digital public health surveillance (DPHS) systems. We conducted a systematic scoping review in accordance with the PRISMA extension for scoping reviews to consolidate and characterize the existing research on DPHS and identify areas for further research. We used Natural Language Processing and content analysis to define the search strings and searched Global Health, Web of Science, PubMed, and Google Scholar from 2005 to January 2020 for peer-reviewed articles on DPHS, with extensive hand searching. Seven hundred fifty-five articles were included in this review. The studies were from 54 countries and utilized 26 digital platforms to study 208 sub-categories of 49 categories associated with 16 public health surveillance (PHS) themes. Most studies were conducted by researchers from the United States (56%, 426) and dominated by communicable diseases-related topics (25%, 187), followed by behavioural risk factors (17%, 131). While this review discusses the potentials of using Internet-based data as an affordable and instantaneous resource for DPHS, it highlights the paucity of longitudinal studies and the methodological and inherent practical limitations underpinning the successful implementation of a DPHS system. Little work studied Internet users' demographics when developing DPHS systems, and 39% (291) of studies did not stratify their results by geographic region. A clear methodology by which the results of DPHS can be linked to public health action has yet to be established, as only six (0.8%) studies deployed their system into a PHS context.
Collapse
Affiliation(s)
- Zahra Shakeri Hossein Abad
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
| | - Adrienne Kline
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Madeena Sultana
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Mohammad Noaeen
- Department of Electrical and Computer Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada
| | - Elvira Nurmambetova
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Filipe Lucini
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Critical Care Medicine, Cumming School of Medicine, University of Calgary and Alberta Health Services, Calgary, AB, Canada
| | - Majed Al-Jefri
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Joon Lee
- Data Intelligence for Health Lab, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Cardiac Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| |
Collapse
|
12
|
Muselli M, Cofini V, Desideri G, Necozione S. Coronavirus (Covid-19) pandemic: How may communication strategies influence our behaviours? INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION : IJDRR 2021; 53:101982. [PMID: 33251100 PMCID: PMC7683303 DOI: 10.1016/j.ijdrr.2020.101982] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 11/18/2020] [Accepted: 11/19/2020] [Indexed: 05/05/2023]
Abstract
A novel Corona virus (SARS-CoV-2), started in Wuhan China, caused an outbreak of viral pneumonia to subsequently spread throughout the world. Italy has been one of the most affected countries in the world and the increasing number of cases and deaths has created strong emotional reactions in people. This study has aimed at evaluating public attention to this emerging disease through the use of Google Trends. Public attention, measured as the volume of internet search activity, was correlated with Health Communication Strategies and official COVID-19 data. At the moment of the study analysis, Italy was by far the first country in terms of search volume for "coronavirus" and the highest peak of searches was reached on February 23, 2020. We have found that there was a correlation between public attention to coronavirus disease and communications from Public Health policies: we observed spikes in search volumes on the days of Presidential Decree publications. Furthermore, this attention was also correlated with Case Fatality Rate (CFR). Even if CFR data are continuously updated and can be affected by patient histories, the correlation found suggests that the increase in mortality has generated growing interest in the disease and its risk perception. This study shows that tracking searches through Google Trends as a public focus indicator is a useful tool for decision-makers in guiding communication strategies and should as well stimulate a more transparent media and policy making reporting.
Collapse
Affiliation(s)
- Mario Muselli
- Department of Life, Health and Environmental Science, University of L'Aquila, Viale S. Salvatore, Delta 6 Medicina, 67100, L'Aquila, Italy
| | - Vincenza Cofini
- Department of Life, Health and Environmental Science, University of L'Aquila, Viale S. Salvatore, Delta 6 Medicina, 67100, L'Aquila, Italy
| | - Giovambattista Desideri
- Department of Life, Health and Environmental Science, University of L'Aquila, Viale S. Salvatore, Delta 6 Medicina, 67100, L'Aquila, Italy
| | - Stefano Necozione
- Department of Life, Health and Environmental Science, University of L'Aquila, Viale S. Salvatore, Delta 6 Medicina, 67100, L'Aquila, Italy
| |
Collapse
|
13
|
Tarkoma S, Alghnam S, Howell MD. Fighting pandemics with digital epidemiology. EClinicalMedicine 2020; 26:100512. [PMID: 32864592 PMCID: PMC7446704 DOI: 10.1016/j.eclinm.2020.100512] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 07/31/2020] [Accepted: 07/31/2020] [Indexed: 12/16/2022] Open
Affiliation(s)
- Sasu Tarkoma
- University of Helsinki, Pietari Kalmin katu 5, 00014, Finland
- Corresponding author.
| | - Suliman Alghnam
- King Abdullah International Medical Research Center (KAIMRC), King Saud Bin Abdulaziz University for Health Sciences (KSAU-HS), Al-Sheikh Jaber Al-Sabah St., 11426 Riyadh, Saudi Arabia
| | | |
Collapse
|
14
|
Gupta A, Katarya R. Social media based surveillance systems for healthcare using machine learning: A systematic review. J Biomed Inform 2020; 108:103500. [PMID: 32622833 PMCID: PMC7331523 DOI: 10.1016/j.jbi.2020.103500] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 06/21/2020] [Accepted: 06/26/2020] [Indexed: 11/30/2022]
Abstract
BACKGROUND Real-time surveillance in the field of health informatics has emerged as a growing domain of interest among worldwide researchers. Evolution in this field has helped in the introduction of various initiatives related to public health informatics. Surveillance systems in the area of health informatics utilizing social media information have been developed for early prediction of disease outbreaks and to monitor diseases. In the past few years, the availability of social media data, particularly Twitter data, enabled real-time syndromic surveillance that provides immediate analysis and instant feedback to those who are charged with follow-ups and investigation of potential outbreaks. In this paper, we review the recent work, trends, and machine learning(ML) text classification approaches used by surveillance systems seeking social media data in the healthcare domain. We also highlight the limitations and challenges followed by possible future directions that can be taken further in this domain. METHODS To study the landscape of research in health informatics performing surveillance of the various health-related data posted on social media or web-based platforms, we present a bibliometric analysis of the 1240 publications indexed in multiple scientific databases (IEEE, ACM Digital Library, ScienceDirect, PubMed) from the year 2010-2018. The papers were further reviewed based on the various machine learning algorithms used for analyzing health-related text posted on social media platforms. FINDINGS Based on the corpus of 148 selected articles, the study finds the types of social media or web-based platforms used for surveillance in the healthcare domain, along with the health topic(s) studied by them. In the corpus of selected articles, we found 26 articles were using machine learning technique. These articles were studied to find commonly used ML techniques. The majority of studies (24%) focused on the surveillance of flu or influenza-like illness (ILI). Twitter (64%) is the most popular data source to perform surveillance research using social media text data, and Support Vector Machine (SVM) (33%) being the most used ML algorithm for text classification. CONCLUSIONS The inclusion of online data in surveillance systems has improved the disease prediction ability over traditional syndromic surveillance systems. However, social media based surveillance systems have many limitations and challenges, including noise, demographic bias, privacy issues, etc. Our paper mentions future directions, which can be useful for researchers working in the area. Researchers can use this paper as a library for social media based surveillance systems in the healthcare domain and can expand such systems by incorporating the future works discussed in our paper.
Collapse
|
15
|
Darmann-Finck I, Rothgang H, Zeeb H. [Digitalization and Health Sciences - White Paper Digital Public Health]. DAS GESUNDHEITSWESEN 2020; 82:620-622. [PMID: 32698204 PMCID: PMC7990570 DOI: 10.1055/a-1191-4344] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Potential limits and risk of digitalization in public health will be a core topic of the High profile Research Area Health Sciences at the University of Bremen in coming years. A white paper was developed to support positioning in this dynamic research. The paper describes the Health Sciences viewpoint on core evaluation criteria for digital public health, identifies interfaces and approaches for interdisciplinary cooperation and discusses cross-cutting themes as well as demarcations with respect to digitalization in medicine (digital health). An abbreviated version of the white paper is presented for discussion.
Collapse
Affiliation(s)
- Ingrid Darmann-Finck
- Institut für Public Health und Pflegewissenschaft, Universität Bremen, Bremen.,Wissenschaftsschwerpunkt Gesundheitswissenschaften, Universität Bremen, Bremen
| | - Heinz Rothgang
- SOCIUM Forschungszentrum Ungleichheit und Sozialpolitik, Universität Bremen, Bremen.,Wissenschaftsschwerpunkt Gesundheitswissenschaften, Universität Bremen, Bremen
| | - Hajo Zeeb
- Wissenschaftsschwerpunkt Gesundheitswissenschaften, Universität Bremen, Bremen.,Abt. Prävention und Evaluation, Leibniz-Institut für Präventionsforschung und Epidemiologie-BIPS, Bremen
| |
Collapse
|
16
|
COVID-19 and digital epidemiology. JOURNAL OF PUBLIC HEALTH-HEIDELBERG 2020; 30:245-247. [PMID: 32355606 PMCID: PMC7190458 DOI: 10.1007/s10389-020-01295-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 04/22/2020] [Indexed: 12/21/2022]
|
17
|
Aiello AE, Renson A, Zivich PN. Social Media- and Internet-Based Disease Surveillance for Public Health. Annu Rev Public Health 2020; 41:101-118. [PMID: 31905322 DOI: 10.1146/annurev-publhealth-040119-094402] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Disease surveillance systems are a cornerstone of public health tracking and prevention. This review addresses the use, promise, perils, and ethics of social media- and Internet-based data collection for public health surveillance. Our review highlights untapped opportunities for integrating digital surveillance in public health and current applications that could be improved through better integration, validation, and clarity on rules surrounding ethical considerations. Promising developments include hybrid systems that couple traditional surveillance data with data from search queries, social media posts, and crowdsourcing. In the future, it will be important to identify opportunities for public and private partnerships, train public health experts in data science, reduce biases related to digital data (gathered from Internet use, wearable devices, etc.), and address privacy. We are on the precipice of an unprecedented opportunity to track, predict, and prevent global disease burdens in the population using digital data.
Collapse
Affiliation(s)
- Allison E Aiello
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| | - Audrey Renson
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| | - Paul N Zivich
- Department of Epidemiology, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7435, USA; , ,
| |
Collapse
|