1
|
Zhang J, Xu W, Lei C, Pu Y, Zhang Y, Zhang J, Yu H, Su X, Huang Y, Gong R, Zhang L, Shi Q. Using Clinician-Patient WeChat Group Communication Data to Identify Symptom Burdens in Patients With Uterine Fibroids Under Focused Ultrasound Ablation Surgery Treatment: Qualitative Study. JMIR Form Res 2023; 7:e43995. [PMID: 37656501 PMCID: PMC10504630 DOI: 10.2196/43995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 12/26/2022] [Accepted: 07/24/2023] [Indexed: 09/02/2023] Open
Abstract
BACKGROUND Unlike research project-based health data collection (questionnaires and interviews), social media platforms allow patients to freely discuss their health status and obtain peer support. Previous literature has pointed out that both public and private social platforms can serve as data sources for analysis. OBJECTIVE This study aimed to use natural language processing (NLP) techniques to identify concerns regarding the postoperative quality of life and symptom burdens in patients with uterine fibroids after focused ultrasound ablation surgery. METHODS Screenshots taken from clinician-patient WeChat groups were converted into free texts using image text recognition technology and used as the research object of this study. From 408 patients diagnosed with uterine fibroids in Chongqing Haifu Hospital between 2010 and 2020, we searched for symptom burdens in over 900,000 words of WeChat group chats. We first built a corpus of symptoms by manually coding 30% of the WeChat texts and then used regular expressions in Python to crawl symptom information from the remaining texts based on this corpus. We compared the results with a manual review (gold standard) of the same records. Finally, we analyzed the relationship between the population baseline data and conceptual symptoms; quantitative and qualitative results were examined. RESULTS A total of 408 patients with uterine fibroids were included in the study; 190,000 words of free text were obtained after data cleaning. The mean age of the patients was 39.94 (SD 6.81) years, and their mean BMI was 22.18 (SD 2.78) kg/m2. The median reporting times of the 7 major symptoms were 21, 26, 57, 2, 18, 30, and 49 days. Logistic regression models identified preoperative menstrual duration (odds ratio [OR] 1.14, 95% CI 5.86-6.37; P=.009), age of menophania (OR -1.02 , 95% CI 11.96-13.47; P=.03), and the number (OR 2.34, 95% CI 1.45-1.83; P=.04) and size of fibroids (OR 0.12, 95% CI 2.43-3.51; P=.04) as significant risk factors for postoperative symptoms. CONCLUSIONS Unstructured free texts from social media platforms extracted by NLP technology can be used for analysis. By extracting the conceptual information about patients' health-related quality of life, we can adopt personalized treatment for patients at different stages of recovery to improve their quality of life. Python-based text mining of free-text data can accurately extract symptom burden and save considerable time compared to manual review, maximizing the utility of the extant information in population-based electronic health records for comparative effectiveness research.
Collapse
Affiliation(s)
- Jiayuan Zhang
- State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
| | - Wei Xu
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Cheng Lei
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Yang Pu
- State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
| | - Yubo Zhang
- State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
| | - Jingyu Zhang
- State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
| | - Hongfan Yu
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Xueyao Su
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Yanyan Huang
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Ruoyan Gong
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Lijun Zhang
- School of Public Health, Chongqing Medical University, Chongqing, China
| | - Qiuling Shi
- State Key Laboratory of Ultrasound in Medicine and Engineering, College of Biomedical Engineering, Chongqing Medical University, Chongqing, China
- School of Public Health, Chongqing Medical University, Chongqing, China
| |
Collapse
|
2
|
Abbasi-Perez A, Alvarez-Mon MA, Donat-Vargas C, Ortega MA, Monserrat J, Perez-Gomez A, Alvarez-Mon M. Using Twitter Data Analysis to Understand the Perceptions, Beliefs, and Attitudes about Pharmacotherapy Used in Rheumatology: An Observational Study. Healthcare (Basel) 2023; 11:1526. [PMID: 37297665 PMCID: PMC10252953 DOI: 10.3390/healthcare11111526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 05/18/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023] Open
Abstract
Twitter has become an important platform for disseminating information about rheumatology drugs by patients, health professionals, institutions, and other users. The aim of this study was to analyze tweets related to 16 drugs used in rheumatology, including their volume, content, and type of user (patients, patients' relatives, health professionals, health institutions, pharmaceutical industry, general press, scientific journals and patients' associations), and to detect inappropriate medical content. A total of 8829 original tweets were obtained, with a random sample of 25% of the total number of tweets for each drug (at least 100 tweets) analyzed. Methotrexate (MTX) accounted for a quarter of all tweets, and there were significant differences in the proportion of tweets issued according to the type of user. Patients and their relatives mainly tweeted about MTX, while professionals, institutions, and patient associations posted more about TNF inhibitors. In contrast, the pharmaceutical industry focused on IL-17 inhibitors. Medical content prevailed in all drugs except anti-CD20 and IL-1 inhibitors and the most discussed medical topic was efficacy, followed by posology and adverse effects. Inappropriate or fake content was found to be very low. In conclusion, the majority of the tweets were about MTX, which is a first-line treatment for several diseases. The distribution of medical content varied according to the type of user. In contrast to other studies, the amount of medically inappropriate content was very low.
Collapse
Affiliation(s)
- Adrian Abbasi-Perez
- Service of Internal Medicine, Rheumatology and Autoimmune Diseases, University Hospital “Príncipe de Asturias”, 28805 Alcala de Henares, Spain; (A.A.-P.); (A.P.-G.); (M.A.-M.)
- Department of Medicine and Medical Specialities, Faculty of Medicine and Health Sciences, University of Alcala, 28805 Alcala de Henares, Spain; (M.A.O.); (J.M.)
| | - Miguel Angel Alvarez-Mon
- Department of Medicine and Medical Specialities, Faculty of Medicine and Health Sciences, University of Alcala, 28805 Alcala de Henares, Spain; (M.A.O.); (J.M.)
- Institute Ramon y Cajal for Health Research (IRYCIS), 28034 Madrid, Spain
| | - Carolina Donat-Vargas
- Cardiovascular and Nutritional Epidemiology, Institute of Environmental Medicine, Karolinska Institute, 17177 Stockholm, Sweden;
- IMDEA-Food Institute, Campus of International Excellence, Universidad Autónoma de Madrid, Consejo Superior de Investigaciones Científicas, 28049 Madrid, Spain
| | - Miguel A. Ortega
- Department of Medicine and Medical Specialities, Faculty of Medicine and Health Sciences, University of Alcala, 28805 Alcala de Henares, Spain; (M.A.O.); (J.M.)
- Institute Ramon y Cajal for Health Research (IRYCIS), 28034 Madrid, Spain
| | - Jorge Monserrat
- Department of Medicine and Medical Specialities, Faculty of Medicine and Health Sciences, University of Alcala, 28805 Alcala de Henares, Spain; (M.A.O.); (J.M.)
- Institute Ramon y Cajal for Health Research (IRYCIS), 28034 Madrid, Spain
| | - Ana Perez-Gomez
- Service of Internal Medicine, Rheumatology and Autoimmune Diseases, University Hospital “Príncipe de Asturias”, 28805 Alcala de Henares, Spain; (A.A.-P.); (A.P.-G.); (M.A.-M.)
| | - Melchor Alvarez-Mon
- Service of Internal Medicine, Rheumatology and Autoimmune Diseases, University Hospital “Príncipe de Asturias”, 28805 Alcala de Henares, Spain; (A.A.-P.); (A.P.-G.); (M.A.-M.)
- Department of Medicine and Medical Specialities, Faculty of Medicine and Health Sciences, University of Alcala, 28805 Alcala de Henares, Spain; (M.A.O.); (J.M.)
- Institute Ramon y Cajal for Health Research (IRYCIS), 28034 Madrid, Spain
| |
Collapse
|
3
|
Zhang J, Xu W, Lei C, Pu Y, Zhang Y, Zhang J, Yu H, Su X, Huang Y, Gong R, Zhang L, Shi Q. Using WeChat clinician-patient group communication data to identify symptom burdens in patients with uterine fibroids under focused ultrasound ablation surgery treatment :Qualitative Study (Preprint).. [DOI: 10.2196/preprints.43995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
BACKGROUND
Unlike research project-based health data collections(questionnaires, interviews), social media platforms, which allow patients to freely discuss their health status and obtain peer support.Previous literature has pointed out that both public and private social platforms can serve as data sources for analysing.
OBJECTIVE
This study aimed to use natural language processing (NLP) techniques to identify concerns regarding the postoperative quality of life and symptom burdens in uterine fibroids after focused ultrasound ablation surgery.
METHODS
Screenshots taken from the clinician-patient WeChat groups were converted into free texts using image text recognition technology and used as the research object of this study, which used regular expressions in Python to search for symptom burdens in over 900,000 words of WeChat group-chats associated with 408 patients in Chongqing Haifu Hospital diagnosed with uterine fibroids between 2010 and 2020. We first built a corpus of symptoms by manually coding 30% of the WeChat texts, and then used regular expressions to crawl symptom information from the remaining texts based on this corpus. We compared the results with a manual review (gold standard) of the same records. Then we analyzed the relationship between the population baseline data and conceptual symptoms, Quantitative and qualitative results were examined.
RESULTS
A total of 190,000 words of uterine fibroids patients' free text were finally obtained after data cleaning. A total of 408 patients were included in the study. The age of the patients was 39.94±6.81 years, and their BMI was 22.18±2.78 (kg/m^2). The median reporting times of the seven major symptoms were 21, 26, 57, 2, 18, 30, and 49 days. Results showed that patients with dysmenorrhea were younger(mean 38.26 (SD 7.05), P=.004) and slimmer (mean 22.37 (SD 3.81), P=.04), with lower fertility and parity (P<.05), and tended to stay longer in the hospital (P<.05). Logistic regression models identified preoperative menstrual duration (OR 1.14, 95% CI 5.86-6.37; P= .009), age of menophania (OR -1.02 ,95%CI 11.96-13.47,P=.03), and the number(OR 2.34,95% CI 1.45-1.83,P=.04) and size of fibroids(OR 0.12,95% CI 2.43-3.51,P=.04) as significant risk factors for postoperative symptoms.
CONCLUSIONS
Unstructured free texts from social media platforms extracted by NLP technology can be used for analysis, extracting the conceptual information about patients' HRQol,adopt personalized treatment for patients at different stages of recovery to improve the quality of life of patients. Python-based text mining of free-text data can accurately extract symptom burden administered and save considerable time compared to manual review, maximizing the utility of the extant information in population-based electronic health records for comparative effectiveness research.
CLINICALTRIAL
Collapse
|
4
|
Hswen Y, Yom-Tov E. Analysis of a Vaping-Associated Lung Injury Outbreak through Participatory Surveillance and Archival Internet Data. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18158203. [PMID: 34360495 PMCID: PMC8346109 DOI: 10.3390/ijerph18158203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 07/28/2021] [Accepted: 07/30/2021] [Indexed: 11/22/2022]
Abstract
The US Centers for Disease Control and Prevention alerted of a suspected outbreak of lung illness associated with using E-cigarette products in September 2019. At the time that the CDC published its alert little was known about the causes of the outbreak or who was at risk for it. Here we provide insights into the outbreak through analysis of passive reporting and participatory surveillance. We collected data about vaping habits and associated adverse reactions from four data sources pertaining to people in the USA: A participatory surveillance platform (YouVape), Reddit, Google Trends, and Bing. Data were analyzed to identify vaping behaviors and reported adverse events. These were correlated among sources and with prior reports. Data was obtained from 720 YouVape users, 4331 Reddit users, and over 1 million Bing users. Large geographic variation was observed across vaping products. Significant correlation was found among the data sources in reported adverse reactions. Models of participatory surveillance data found specific product and adverse reaction associations. Specifically, cannabidiol was found to be associated with fever, while tetrahydrocannabinol was found to be correlated with diarrhea. Our results demonstrate that utilization of different, complementary, online data sources provide a holistic view of vaping associated lung injury while augmenting traditional data sources.
Collapse
Affiliation(s)
- Yulin Hswen
- Department of Epidemiology and Biostatistics, University of California at San Francisco, San Francisco, CA 94158, USA;
- Bakar Computational Health Sciences Institute, University of California at San Francisco, San Francisco, CA 94143, USA
- Innovation Program, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Elad Yom-Tov
- Microsoft Research Israel, 3 Alan Turing Str., Herzeliya 4672415, Israel
- Faculty of Industrial Engineering and Management, Technion, Haifa 3200000, Israel
- Correspondence:
| |
Collapse
|
5
|
Gabarron E, Rivera-Romero O, Miron-Shatz T, Grainger R, Denecke K. Role of Participatory Health Informatics in Detecting and Managing Pandemics: Literature Review. Yearb Med Inform 2021; 30:200-209. [PMID: 33882600 PMCID: PMC8432992 DOI: 10.1055/s-0041-1726486] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
OBJECTIVES Using participatory health informatics (PHI) to detect disease outbreaks or learn about pandemics has gained interest in recent years. However, the role of PHI in understanding and managing pandemics, citizens' role in this context, and which methods are relevant for collecting and processing data are still unclear, as is which types of data are relevant. This paper aims to clarify these issues and explore the role of PHI in managing and detecting pandemics. METHODS Through a literature review we identified studies that explore the role of PHI in detecting and managing pandemics. Studies from five databases were screened: PubMed, CINAHL (Cumulative Index to Nursing and Allied Health Literature), IEEE Xplore, ACM (Association for Computing Machinery) Digital Library, and Cochrane Library. Data from studies fulfilling the eligibility criteria were extracted and synthesized narratively. RESULTS Out of 417 citations retrieved, 53 studies were included in this review. Most research focused on influenza-like illnesses or COVID-19 with at least three papers on other epidemics (Ebola, Zika or measles). The geographic scope ranged from global to concentrating on specific countries. Multiple processing and analysis methods were reported, although often missing relevant information. The majority of outcomes are reported for two application areas: crisis communication and detection of disease outbreaks. CONCLUSIONS For most diseases, the small number of studies prevented reaching firm conclusions about the utility of PHI in detecting and monitoring these disease outbreaks. For others, e.g., COVID-19, social media and online search patterns corresponded to disease patterns, and detected disease outbreak earlier than conventional public health methods, thereby suggesting that PHI can contribute to disease and pandemic monitoring.
Collapse
Affiliation(s)
- Elia Gabarron
- Norwegian Centre for E-health Research, University Hospital of North Norway, Troms⊘, Norway
| | | | - Talya Miron-Shatz
- Faculty of Business Administration, Ono Academic College, Israel
- Winton Centre for Risk and Evidence Communication, Cambridge University, England
| | - Rebecca Grainger
- Department of Medicine, University of Otago, Wellington, New Zealand
| | - Kerstin Denecke
- Institute for Medical Informatics, Bern University of Applied Sciences, Bern, Switzerland
| |
Collapse
|
6
|
Jiao J, Suarez GP, Fefferman NH. How public reaction to disease information across scales and the impacts of vector control methods influence disease prevalence and control efficacy. PLoS Comput Biol 2021; 17:e1008762. [PMID: 34181645 PMCID: PMC8270472 DOI: 10.1371/journal.pcbi.1008762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 07/09/2021] [Accepted: 05/28/2021] [Indexed: 11/10/2022] Open
Abstract
With the development of social media, the information about vector-borne disease incidence over broad spatial scales can cause demand for local vector control before local risk exists. Anticipatory intervention may still benefit local disease control efforts; however, infection risks are not the only focal concerns governing public demand for vector control. Concern for environmental contamination from pesticides and economic limitations on the frequency and magnitude of control measures also play key roles. Further, public concern may be focused more on ecological factors (i.e., controlling mosquito populations) or on epidemiological factors (i.e., controlling infection-carrying mosquitoes), which may lead to very different control outcomes. Here we introduced a generic Ross-MacDonald model, incorporating these factors under three spatial scales of disease information: local, regional, and global. We tailored and parameterized the model for Zika virus transmitted by Aedes aegypti mosquito. We found that sensitive reactivity caused by larger-scale incidence information could decrease average human infections per patch breeding capacity, however, the associated increase in total control effort plays a larger role, which leads to an overall decrease in control efficacy. The shift of focal concerns from epidemiological to ecological risk could relax the negative effect of the sensitive reactivity on control efficacy when mosquito breeding capacity populations are expected to be large. This work demonstrates that, depending on expected total mosquito breeding capacity population size, and weights of different focal concerns, large-scale disease information can reduce disease infections without lowering control efficacy. Our findings provide guidance for vector-control strategies by considering public reaction through social media.
Collapse
Affiliation(s)
- Jing Jiao
- National Institute for Mathematical and Biological Synthesis, The University of Tennessee, Knoxville, Tennessee, United States of America
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Gonzalo P. Suarez
- Department of Agriculture and Biological Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Nina H. Fefferman
- National Institute for Mathematical and Biological Synthesis, The University of Tennessee, Knoxville, Tennessee, United States of America
- Ecology & Evolutionary Biology, The University of Tennessee, Knoxville, Tennessee, United States of America
| |
Collapse
|
7
|
Edo-Osagie O, De La Iglesia B, Lake I, Edeghere O. A scoping review of the use of Twitter for public health research. Comput Biol Med 2020; 122:103770. [PMID: 32502758 PMCID: PMC7229729 DOI: 10.1016/j.compbiomed.2020.103770] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 04/01/2020] [Accepted: 04/17/2020] [Indexed: 11/25/2022]
Abstract
Public health practitioners and researchers have used traditional medical databases to study and understand public health for a long time. Recently, social media data, particularly Twitter, has seen some use for public health purposes. Every large technological development in history has had an impact on the behaviour of society. The advent of the internet and social media is no different. Social media creates public streams of communication, and scientists are starting to understand that such data can provide some level of access into the people's opinions and situations. As such, this paper aims to review and synthesize the literature on Twitter applications for public health, highlighting current research and products in practice. A scoping review methodology was employed and four leading health, computer science and cross-disciplinary databases were searched. A total of 755 articles were retreived, 92 of which met the criteria for review. From the reviewed literature, six domains for the application of Twitter to public health were identified: (i) Surveillance; (ii) Event Detection; (iii) Pharmacovigilance; (iv) Forecasting; (v) Disease Tracking; and (vi) Geographic Identification. From our review, we were able to obtain a clear picture of the use of Twitter for public health. We gained insights into interesting observations such as how the popularity of different domains changed with time, the diseases and conditions studied and the different approaches to understanding each disease, which algorithms and techniques were popular with each domain, and more.
Collapse
Affiliation(s)
- Oduwa Edo-Osagie
- School of Computing Science, University of East Anglia, Norwich, NR4 7TJ, UK.
| | | | - Iain Lake
- School of Environmental Science, University of East Anglia, Norwich, NR4 7TJ, UK
| | - Obaghe Edeghere
- National Infection Service, Public Health England, Birmingham, B3 2PW, UK
| |
Collapse
|
8
|
Shahid F, Ony SH, Albi TR, Chellappan S, Vashistha A, Islam ABMAA. Learning from Tweets: Opportunities and Challenges to Inform Policy Making During Dengue Epidemic. ACTA ACUST UNITED AC 2020. [DOI: 10.1145/3392875] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Social media platforms are widely used by people to report, access, and share information during outbreaks and epidemics. Although government agencies and healthcare institutions in developed regions are increasingly relying on social media to develop epidemic forecasts and outbreak response, there is a limited understanding of how people in developing regions interact on social media during outbreaks and what useful insights this dataset could offer during public health crises. In this work, we examined 28,688 tweets to identify public health issues during dengue epidemic in Bangladesh and found several insights, such as irregularities in dengue diagnosis and treatment, shortage of blood supply for Rh negative blood groups, and high local transmission of dengue during Eid-ul-Adha, that impact disease preparedness and outbreak response. We discuss the opportunities and challenges in analyzing tweets and outline how government agencies and healthcare institutions can use social media health data to inform policy making during public health crises.
Collapse
|
9
|
Barros JM, Duggan J, Rebholz-Schuhmann D. The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review. J Med Internet Res 2020; 22:e13680. [PMID: 32167477 PMCID: PMC7101503 DOI: 10.2196/13680] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2019] [Revised: 09/18/2019] [Accepted: 11/26/2019] [Indexed: 12/30/2022] Open
Abstract
Background Public health surveillance is based on the continuous and systematic collection, analysis, and interpretation of data. This informs the development of early warning systems to monitor epidemics and documents the impact of intervention measures. The introduction of digital data sources, and specifically sources available on the internet, has impacted the field of public health surveillance. New opportunities enabled by the underlying availability and scale of internet-based sources (IBSs) have paved the way for novel approaches for disease surveillance, exploration of health communities, and the study of epidemic dynamics. This field and approach is also known as infodemiology or infoveillance. Objective This review aimed to assess research findings regarding the application of IBSs for public health surveillance (infodemiology or infoveillance). To achieve this, we have presented a comprehensive systematic literature review with a focus on these sources and their limitations, the diseases targeted, and commonly applied methods. Methods A systematic literature review was conducted targeting publications between 2012 and 2018 that leveraged IBSs for public health surveillance, outbreak forecasting, disease characterization, diagnosis prediction, content analysis, and health-topic identification. The search results were filtered according to previously defined inclusion and exclusion criteria. Results Spanning a total of 162 publications, we determined infectious diseases to be the preferred case study (108/162, 66.7%). Of the eight categories of IBSs (search queries, social media, news, discussion forums, websites, web encyclopedia, and online obituaries), search queries and social media were applied in 95.1% (154/162) of the reviewed publications. We also identified limitations in representativeness and biased user age groups, as well as high susceptibility to media events by search queries, social media, and web encyclopedias. Conclusions IBSs are a valuable proxy to study illnesses affecting the general population; however, it is important to characterize which diseases are best suited for the available sources; the literature shows that the level of engagement among online platforms can be a potential indicator. There is a necessity to understand the population’s online behavior; in addition, the exploration of health information dissemination and its content is significantly unexplored. With this information, we can understand how the population communicates about illnesses online and, in the process, benefit public health.
Collapse
Affiliation(s)
- Joana M Barros
- Insight Centre for Data Analytics, National University of Ireland Galway, Galway, Ireland.,School of Computer Science, National University of Ireland Galway, Galway, Ireland
| | - Jim Duggan
- School of Computer Science, National University of Ireland Galway, Galway, Ireland
| | | |
Collapse
|
10
|
Samaras L, García-Barriocanal E, Sicilia MA. Syndromic surveillance using web data: a systematic review. INNOVATION IN HEALTH INFORMATICS 2020. [PMCID: PMC7153324 DOI: 10.1016/b978-0-12-819043-2.00002-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
During the recent years, a lot of debate is taken place about the evolution of Smart Healthcare systems. Particularly, how these systems can help people improve human conditions of health, by taking advantages of the new Information and Communication Technologies (ICT), regarding early prediction and efficient treatment. The purpose of this study is to provide a systematic review of the current literature available that focuses on information systems on syndromic surveillance using web data. All published items concern articles, books, reviews, reports, conference announcements, and dissertations. We used a variation of PRISMA Statements methodology to conduct a systematic review. The review identifies the relevant published papers from the year 2004 to 2018, systematically includes and explores them to extract similarities, gaps, and conclusions on the research that has been done so far. The results presented concern the year, the examined disease, the web data source, the geographic location/country, and the data analysis method used. The results show that influenza is the most examined infectious disease. The internet tools most used are Twitter and Google. Regarding the geographical areas explored in the published papers, the most examined country is the United States, since many scientists come from this country. There is a significant growth of articles since 2009. There are also various statistical methods used to correlate the data retrieved from the internet to the data from national authorities. The conclusion of all researches is that the Web can be a useful tool for the detection of serious epidemics and for a creation of a syndromic surveillance system using the Web, since we can predict epidemics from web data before they are officially detected in population. With the advance of ICT, Smart Healthcare can benefit from the monitoring of epidemics and the early prediction of such a system, improving national or international health strategies and policy decision. This can be achieved through the provision of new technology tools to enhance health monitoring systems toward the new innovations of Smart Health or eHealth, even with the emerging technologies of Internet of Things. The challenges and impacts of an electronic system based on internet data include the social, medical, and technological disciplines. These can be further extended to Smart Healthcare, as the data streaming can provide with real-time information, awareness on epidemics and alerts for both patients or medical scientists. Finally, these new systems can help improve the standards of human life.
Collapse
|
11
|
Lami F, Asi W, Khistawi A, Jawad I. Syndromic Surveillance of Communicable Diseases in Mobile Clinics During the Arbaeenia Mass Gathering in Wassit Governorate, Iraq, in 2014: Cross-Sectional Study. JMIR Public Health Surveill 2019; 5:e10920. [PMID: 31593544 PMCID: PMC6803892 DOI: 10.2196/10920] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 12/01/2018] [Accepted: 12/23/2018] [Indexed: 11/30/2022] Open
Abstract
Background Arbaeenia is the largest religious mass gathering organized annually in Karbala city, Iraq, and is attended by 8-14 million people. Outbreaks of communicable diseases are a significant risk due to overcrowding and potential food and water contamination. Syndromic surveillance is often used for rapid detection and response to disease outbreaks. Objective This study was conducted to identify the main communicable diseases syndromes among pilgrims during the Arbaeenia mass gathering in Wassit governorate, Iraq, in 2014. Methods This cross-sectional study was conducted in the 40 mobile clinics established within Wassit governorates along the road to Karbala during the Arbaeenia mass gathering. Six communicable disease syndromes were selected: acute watery diarrhea, bloody diarrhea, fever and cough, vomiting with or without diarrhea, fever and bleeding tendency, and fever and rash. A simple questionnaire was used to directly gather basic demographics and the syndromic diagnosis from the attendees. Results A total of 87,865 patients attended the clinics during the 10-day period, with an average of 219 patients/clinic/day. Approximately 5% (3999) of the attendees had communicable diseases syndromes: of these, 1693 (42%) had fever and cough, 1144 (29%) had acute diarrhea, 1062 (27%) presented with vomiting with/without diarrhea, and 100 (2%) had bloody diarrhea. The distribution of the syndromes did not vary by age or gender. Stool specimen cultures for Vibrio cholerae performed for 120 patients with acute diarrhea were all negative. Conclusions Syndromic surveillance was useful in determining the main communicable diseases encountered during the mass gathering. Expansion of this surveillance to other governorates and the use of mobile technology can help in timely detection and response to communicable disease outbreaks.
Collapse
Affiliation(s)
- Faris Lami
- Department of Community and Family Medicine, College of Medicine, University of Baghdad, Baghdad, Iraq
| | - Wejdan Asi
- Wasit Directorate of Health, Iraq Ministry of Health, Wasit, Iraq
| | - Adnan Khistawi
- Directorate of Public Health, Iraq Ministry of Health, Baghdad, Iraq
| | - Iman Jawad
- Wasit Directorate of Health, Iraq Ministry of Health, Wasit, Iraq
| |
Collapse
|
12
|
Raghupathi V, Zhou Y, Raghupathi W. Exploring Big Data Analytic Approaches to Cancer Blog Text Analysis. INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS 2019. [DOI: 10.4018/ijhisi.2019100101] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this article, the authors explore the potential of a big data analytics approach to unstructured text analytics of cancer blogs. The application is developed using Cloudera platform's Hadoop MapReduce framework. It uses several text analytics algorithms, including word count, word association, clustering, and classification, to identify and analyze the patterns and keywords in cancer blog postings. This article establishes an exploratory approach to involving big data analytics methods in developing text analytics applications for the analysis of cancer blogs. Additional insights are extracted through various means, including the development of categories or keywords contained in the blogs, the development of a taxonomy, and the examination of relationships among the categories. The application has the potential for generalizability and implementation with health content in other blogs and social media. It can provide insight and decision support for cancer management and facilitate efficient and relevant searches for information related to cancer.
Collapse
Affiliation(s)
- Viju Raghupathi
- Koppelman School of Business, Brooklyn College of the City University of New York, Brooklyn, USA
| | - Yilu Zhou
- Gabelli School of Business, Fordham University, New York, USA
| | | |
Collapse
|
13
|
Characterizing Consumer Behavior in Leveraging Social Media for E-Patient and Health-Related Activities. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2019; 16:ijerph16183348. [PMID: 31514276 PMCID: PMC6765822 DOI: 10.3390/ijerph16183348] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 09/06/2019] [Accepted: 09/07/2019] [Indexed: 11/17/2022]
Abstract
The emergence of e-patients has encouraged consumers, people who are non-medical experts, to be more engaged in healthcare needs by utilizing online sources via social media. However, the nature of social media and regulation issues have caused concerns for the reliability and validity of the shared information. These phenomena shape consumers behavior in leveraging social media for e-patient activities. This study investigates consumer behavior using an integrated model based on the Unified Theory of Acceptance and Use of Technology (UTAUT) and the Protection Motivation Theory (PMT). The data collected from the participants (N = 312) was analyzed using partial least square structural equation modelling. The results showed that behavioral intention to use social media for e-patient activities was significantly affected by performance expectancy, effort expectancy, perceived severity, perceived susceptibility, and response efficacy; and that behavioral intention corresponded positively to usage intention. In addition, the results also indicate that the intention to use social media for health-related purposes is driven by awareness of preventing health problems and attempts to reduce the risk of developing an illness. Based on findings, this study recommends strategies and initiatives to optimize social media for promoting a healthy lifestyle and educating society about public health and healthcare management.
Collapse
|
14
|
Yuan M, Boston-Fisher N, Luo Y, Verma A, Buckeridge DL. A systematic review of aberration detection algorithms used in public health surveillance. J Biomed Inform 2019; 94:103181. [PMID: 31014979 DOI: 10.1016/j.jbi.2019.103181] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 04/16/2019] [Accepted: 04/17/2019] [Indexed: 12/21/2022]
Abstract
The algorithms used for detecting anomalies have evolved substantially over the last decade to take advantage of advances in informatics and to accommodate changes in surveillance data. We identified 145 studies since 2007 that evaluated statistical methods used to detect aberrations in public health surveillance data. For each study, we classified the analytic methods and reviewed the evaluation metrics. We also summarized the practical usage of the detection algorithms in public health surveillance systems worldwide. Traditional methods (e.g., control charts, linear regressions) were the focus of most evaluation studies and continue to be used commonly in practice. There was, however, an increase in the number of studies using forecasting methods and studies applying machine learning methods, hidden Markov models, and Bayesian framework to multivariate datasets. Evaluation studies demonstrated improved accuracy with more sophisticated methods, but these methods do not appear to be used widely in public health practice.
Collapse
Affiliation(s)
- Mengru Yuan
- Clinical and Health Informatics Research Group, McGill University, 1140 Pine Avenue West, Montreal, QC H3A 1A3, Canada
| | - Nikita Boston-Fisher
- Clinical and Health Informatics Research Group, McGill University, 1140 Pine Avenue West, Montreal, QC H3A 1A3, Canada
| | - Yu Luo
- Clinical and Health Informatics Research Group, McGill University, 1140 Pine Avenue West, Montreal, QC H3A 1A3, Canada
| | - Aman Verma
- Clinical and Health Informatics Research Group, McGill University, 1140 Pine Avenue West, Montreal, QC H3A 1A3, Canada
| | - David L Buckeridge
- Clinical and Health Informatics Research Group, McGill University, 1140 Pine Avenue West, Montreal, QC H3A 1A3, Canada.
| |
Collapse
|
15
|
Lebwohl B, Yom-Tov E. Symptoms Prompting Interest in Celiac Disease and the Gluten-Free Diet: Analysis of Internet Search Term Data. J Med Internet Res 2019; 21:e13082. [PMID: 30958273 PMCID: PMC6475820 DOI: 10.2196/13082] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 02/05/2019] [Accepted: 02/11/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Celiac disease, a common immune-based disease triggered by gluten, has diverse clinical manifestations, and the relative distribution of symptoms leading to diagnosis has not been well characterized in the population. OBJECTIVE This study aimed to use search engine data to identify a set of symptoms and conditions that would identify individuals at elevated likelihood of a subsequent celiac disease diagnosis. We also measured the relative prominence of these search terms before versus after a search related to celiac disease. METHODS We extracted English-language queries submitted to the Bing search engine in the United States and identified those who submitted a new celiac-related query during a 1-month period, without any celiac-related queries in the preceding 9 months. We compared the ratio between the number of times that each symptom or condition was asked in the 14 days preceding the first celiac-related query of each person and the number of searches for that same symptom or condition in the 14 days after the celiac-related query. RESULTS We identified 90,142 users who made a celiac-related query, of whom 6528 (7%) exhibited sustained interest, defined as making a query on more than 1 day. Though a variety of symptoms and associated conditions were also queried before a celiac-related query, the maximum area under the receiver operating characteristic curve was 0.53. The symptom most likely to be queried more before than after a celiac-related query was diarrhea (query ratio [QR] 1.28). Extraintestinal symptoms queried before a celiac disease query included headache (QR 1.26), anxiety (QR 1.10), depression (QR 1.03), and attention-deficit hyperactivity disorder (QR 1.64). CONCLUSIONS We found an increase in antecedent searches for symptoms known to be associated with celiac disease, a rise in searches for depression and anxiety, and an increase in symptoms that are associated with celiac disease but may not be reported to health care providers. The protean clinical manifestations of celiac disease are reflected in the diffuse nature of antecedent internet queries of those interested in celiac disease, underscoring the challenge of effective case-finding strategies.
Collapse
Affiliation(s)
- Benjamin Lebwohl
- Celiac Disease Center, Columbia University, New York, NY, United States
| | - Elad Yom-Tov
- Microsoft Research, Herzeliya, Israel.,Technion, Haifa, Israel
| |
Collapse
|
16
|
Jeong S, Kuk S, Kim H. A Smartphone Magnetometer-Based Diagnostic Test for Automatic Contact Tracing in Infectious Disease Epidemics. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2019; 7:20734-20747. [PMID: 34192097 PMCID: PMC7309220 DOI: 10.1109/access.2019.2895075] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 01/19/2019] [Indexed: 05/03/2023]
Abstract
Smartphone magnetometer readings exhibit high linear correlation when two phones coexist within a short distance. Thus, the detected coexistence can serve as a proxy for close human contact events, and one can conceive using it as a possible automatic tool to modernize the contact tracing in infectious disease epidemics. This paper investigates how good a diagnostic test it would be, by evaluating the discriminative and predictive power of the smartphone magnetometer-based contact detection in multiple measures. Based on the sensitivity, specificity, likelihood ratios, and diagnostic odds ratios, we find that the decision made by the smartphone magnetometer-based test can be accurate in telling contacts from no contacts. Furthermore, through the evaluation process, we determine the appropriate range of compared trace segment sizes and the correlation cutoff values that we should use in such diagnostic tests.
Collapse
Affiliation(s)
- Seungyeon Jeong
- Department of Computer Science and EngineeringKorea UniversitySeoul02841South Korea
| | - Seungho Kuk
- Department of Computer Science and EngineeringKorea UniversitySeoul02841South Korea
| | - Hyogon Kim
- Department of Computer Science and EngineeringKorea UniversitySeoul02841South Korea
| |
Collapse
|
17
|
Schwab-Reese LM, Hovdestad W, Tonmyr L, Fluke J. The potential use of social media and other internet-related data and communications for child maltreatment surveillance and epidemiological research: Scoping review and recommendations. CHILD ABUSE & NEGLECT 2018; 85:187-201. [PMID: 29366596 PMCID: PMC7112406 DOI: 10.1016/j.chiabu.2018.01.014] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 12/06/2017] [Accepted: 01/12/2018] [Indexed: 05/12/2023]
Abstract
Collecting child maltreatment data is a complicated undertaking for many reasons. As a result, there is an interest by child maltreatment researchers to develop methodologies that allow for the triangulation of data sources. To better understand how social media and internet-based technologies could contribute to these approaches, we conducted a scoping review to provide an overview of social media and internet-based methodologies for health research, to report results of evaluation and validation research on these methods, and to highlight studies with potential relevance to child maltreatment research and surveillance. Many approaches were identified in the broad health literature; however, there has been limited application of these approaches to child maltreatment. The most common use was recruiting participants or engaging existing participants using online methods. From the broad health literature, social media and internet-based approaches to surveillance and epidemiologic research appear promising. Many of the approaches are relatively low cost and easy to implement without extensive infrastructure, but there are also a range of limitations for each method. Several methods have a mixed record of validation and sources of error in estimation are not yet understood or predictable. In addition to the problems relevant to other health outcomes, child maltreatment researchers face additional challenges, including the complex ethical issues associated with both internet-based and child maltreatment research. If these issues are adequately addressed, social media and internet-based technologies may be a promising approach to reducing some of the limitations in existing child maltreatment data.
Collapse
Affiliation(s)
- Laura M Schwab-Reese
- The Kempe Center for The Prevention and Treatment of Child Abuse and Neglect, University of Colorado, Anschutz Medical Campus, 13123 E 16th Ave., Aurora, CO 80045, USA.
| | - Wendy Hovdestad
- Public Health Agency of Canada, 785 Carling Ave., Ottawa, ON, K1A 0K9, Canada
| | - Lil Tonmyr
- Public Health Agency of Canada, 785 Carling Ave., Ottawa, ON, K1A 0K9, Canada
| | - John Fluke
- The Kempe Center for The Prevention and Treatment of Child Abuse and Neglect, University of Colorado, Anschutz Medical Campus, 13123 E 16th Ave., Aurora, CO 80045, USA
| |
Collapse
|
18
|
Gupta R, Gupta M, Calix RA, Bernard GR. Identifying personal health experience tweets with deep neural networks. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2018; 2017:1174-1177. [PMID: 29060084 DOI: 10.1109/embc.2017.8037039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Twitter, as a social media platform, has become an increasingly useful data source for health surveillance studies, and personal health experiences shared on Twitter provide valuable information to the surveillance. Twitter data are known for their irregular usages of languages and informal short texts due to the 140 character limit, and for their noisiness such that majority of the posts are irrelevant to any particular health surveillance. These factors pose challenges in identifying personal health experience tweets from the Twitter data. In this study, we designed deep neural networks with 3 different architectural configurations, and after training them with a corpus of 8,770 annotated tweets, we used them to predict personal experience tweets from a set of 821 annotate tweets. Our results demonstrated a significant amount of improvement in predicting personal health experience tweets by deep neural networks over that by conventional classifiers: 37.5% in accuracy, 31.1% in precision, and 53.6% in recall. We believe that our method can be utilized in various health surveillance studies using Twitter as a data source.
Collapse
|
19
|
Oldroyd RA, Morris MA, Birkin M. Identifying Methods for Monitoring Foodborne Illness: Review of Existing Public Health Surveillance Techniques. JMIR Public Health Surveill 2018; 4:e57. [PMID: 29875090 PMCID: PMC6010836 DOI: 10.2196/publichealth.8218] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2017] [Revised: 01/16/2018] [Accepted: 01/31/2018] [Indexed: 11/13/2022] Open
Abstract
Background Traditional methods of monitoring foodborne illness are associated with problems of untimeliness and underreporting. In recent years, alternative data sources such as social media data have been used to monitor the incidence of disease in the population (infodemiology and infoveillance). These data sources prove timelier than traditional general practitioner data, they can help to fill the gaps in the reporting process, and they often include additional metadata that is useful for supplementary research. Objective The aim of the study was to identify and formally analyze research papers using consumer-generated data, such as social media data or restaurant reviews, to quantify a disease or public health ailment. Studies of this nature are scarce within the food safety domain, therefore identification and understanding of transferrable methods in other health-related fields are of particular interest. Methods Structured scoping methods were used to identify and analyze primary research papers using consumer-generated data for disease or public health surveillance. The title, abstract, and keyword fields of 5 databases were searched using predetermined search terms. A total of 5239 papers matched the search criteria, of which 145 were taken to full-text review—62 papers were deemed relevant and were subjected to data characterization and thematic analysis. Results The majority of studies (40/62, 65%) focused on the surveillance of influenza-like illness. Only 10 studies (16%) used consumer-generated data to monitor outbreaks of foodborne illness. Twitter data (58/62, 94%) and Yelp reviews (3/62, 5%) were the most commonly used data sources. Studies reporting high correlations against baseline statistics used advanced statistical and computational approaches to calculate the incidence of disease. These include classification and regression approaches, clustering approaches, and lexicon-based approaches. Although they are computationally intensive due to the requirement of training data, studies using classification approaches reported the best performance. Conclusions By analyzing studies in digital epidemiology, computer science, and public health, this paper has identified and analyzed methods of disease monitoring that can be transferred to foodborne disease surveillance. These methods fall into 4 main categories: basic approach, classification and regression, clustering approaches, and lexicon-based approaches. Although studies using a basic approach to calculate disease incidence generally report good performance against baseline measures, they are sensitive to chatter generated by media reports. More computationally advanced approaches are required to filter spurious messages and protect predictive systems against false alarms. Research using consumer-generated data for monitoring influenza-like illness is expansive; however, research regarding the use of restaurant reviews and social media data in the context of food safety is limited. Considering the advantages reported in this review, methods using consumer-generated data for foodborne disease surveillance warrant further investment.
Collapse
Affiliation(s)
- Rachel A Oldroyd
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,School of Geography, University of Leeds, Leeds, United Kingdom
| | - Michelle A Morris
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,School of Medicine, University of Leeds, Leeds, United Kingdom
| | - Mark Birkin
- Leeds Institute for Data Analytics, University of Leeds, Leeds, United Kingdom.,School of Geography, University of Leeds, Leeds, United Kingdom
| |
Collapse
|
20
|
Clyne W, Pezaro S, Deeny K, Kneafsey R. Using Social Media to Generate and Collect Primary Data: The #ShowsWorkplaceCompassion Twitter Research Campaign. JMIR Public Health Surveill 2018; 4:e41. [PMID: 29685866 PMCID: PMC5938572 DOI: 10.2196/publichealth.7686] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2017] [Revised: 11/24/2017] [Accepted: 02/28/2018] [Indexed: 11/13/2022] Open
Abstract
Background Compassion is a core value embedded in the concept of quality in healthcare. The need for compassion toward healthcare staff in the workplace, for their own health and well-being and also to enable staff to deliver compassionate care for patients, is increasingly understood. However, we do not currently know how healthcare staff understand and characterize compassion toward themselves as opposed to patients. Objective The aim of this study was to use social media for the generation and collection of primary data to gain understanding of the concept of workplace compassion. Methods Tweets that contained the hashtag #ShowsWorkplaceCompassion were collected from Twitter and analyzed. The study took place between April 21 and May 21, 2016. Participants were self-selecting users of the social media service Twitter. The study was promoted by a number of routes: the National Health Service (NHS) England website, the personal Twitter accounts of the research team, internal NHS England communications, and via social media sharing. Participants were asked to contribute their views about what activities, actions, policies, philosophies or approaches demonstrate workplace compassion in healthcare using the hashtag #ShowsWorkplaceCompassion. All tweets including the research hashtag #ShowsWorkplaceCompassion were extracted from Twitter and studied using content analysis. Data concerning the frequency, nature, origin, and location of Web-based engagement with the research campaign were collected using Bitly (Bitly, Inc, USA) and Symplur (Symplur LLC, USA) software. Results A total of 260 tweets were analyzed. Of the 251 statements within the tweets that were coded, 37.8% (95/251) of the statements concerned Leadership and Management aspects of workplace compassion, 29.5% (74/251) were grouped under the theme related to Values and Culture, 17.5% (44/251) of the statements related to Personalized Policies and Procedures that support workplace compassion, and 15.2% (38/251) of the statements concerned Activities and Actions that show workplace compassion. Content analysis showed that small acts of kindness, an embedded organizational culture of caring for one another, and recognition of the emotional and physical impact of healthcare work were the most frequently mentioned characteristics of workplace compassion in healthcare. Conclusions This study presents a new and innovative research approach using Twitter. Although previous research has analyzed the nature and pattern of tweets retrospectively, this study used Twitter to both recruit participants and collect primary data.
Collapse
Affiliation(s)
- Wendy Clyne
- Hope for the Community, Community Interest Company, Coventry, United Kingdom
| | - Sally Pezaro
- School of Nursing, Midwifery and Health, Coventry University, Coventry, United Kingdom
| | - Karen Deeny
- Hope for the Community, Community Interest Company, Coventry, United Kingdom
| | - Rosie Kneafsey
- School of Nursing, Midwifery and Health, Coventry University, Coventry, United Kingdom
| |
Collapse
|
21
|
Mejova Y, Weber I, Fernandez-Luque L. Online Health Monitoring using Facebook Advertisement Audience Estimates in the United States: Evaluation Study. JMIR Public Health Surveill 2018; 4:e30. [PMID: 29592849 PMCID: PMC5895920 DOI: 10.2196/publichealth.7217] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 04/25/2017] [Accepted: 10/08/2017] [Indexed: 01/08/2023] Open
Abstract
Background Facebook, the most popular social network with over one billion daily users, provides rich opportunities for its use in the health domain. Though much of Facebook’s data are not available to outsiders, the company provides a tool for estimating the audience of Facebook advertisements, which includes aggregated information on the demographics and interests, such as weight loss or dieting, of Facebook users. This paper explores the potential uses of Facebook ad audience estimates for eHealth by studying the following: (1) for what type of health conditions prevalence estimates can be obtained via social media and (2) what type of marker interests are useful in obtaining such estimates, which can then be used for recruitment within online health interventions. Objective The objective of this study was to understand the limitations and capabilities of using Facebook ad audience estimates for public health monitoring and as a recruitment tool for eHealth interventions. Methods We use the Facebook Marketing application programming interface to correlate estimated sizes of audiences having health-related interests with public health data. Using several study cases, we identify both potential benefits and challenges in using this tool. Results We find several limitations in using Facebook ad audience estimates, for example, using placebo interest estimates to control for background level of user activity on the platform. Some Facebook interests such as plus-size clothing show encouraging levels of correlation (r=.74) across the 50 US states; however, we also sometimes find substantial correlations with the placebo interests such as r=.68 between interest in Technology and Obesity prevalence. Furthermore, we find demographic-specific peculiarities in the interests on health-related topics. Conclusions Facebook’s advertising platform provides aggregate data for more than 190 million US adults. We show how disease-specific marker interests can be used to model prevalence rates in a simple and intuitive manner. However, we also illustrate that building effective marker interests involves some trial-and-error, as many details about Facebook’s black box remain opaque.
Collapse
Affiliation(s)
- Yelena Mejova
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Ingmar Weber
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | | |
Collapse
|
22
|
Lu FS, Hou S, Baltrusaitis K, Shah M, Leskovec J, Sosic R, Hawkins J, Brownstein J, Conidi G, Gunn J, Gray J, Zink A, Santillana M. Accurate Influenza Monitoring and Forecasting Using Novel Internet Data Streams: A Case Study in the Boston Metropolis. JMIR Public Health Surveill 2018; 4:e4. [PMID: 29317382 PMCID: PMC5780615 DOI: 10.2196/publichealth.8950] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 11/08/2017] [Accepted: 11/12/2017] [Indexed: 11/30/2022] Open
Abstract
Background Influenza outbreaks pose major challenges to public health around the world, leading to thousands of deaths a year in the United States alone. Accurate systems that track influenza activity at the city level are necessary to provide actionable information that can be used for clinical, hospital, and community outbreak preparation. Objective Although Internet-based real-time data sources such as Google searches and tweets have been successfully used to produce influenza activity estimates ahead of traditional health care–based systems at national and state levels, influenza tracking and forecasting at finer spatial resolutions, such as the city level, remain an open question. Our study aimed to present a precise, near real-time methodology capable of producing influenza estimates ahead of those collected and published by the Boston Public Health Commission (BPHC) for the Boston metropolitan area. This approach has great potential to be extended to other cities with access to similar data sources. Methods We first tested the ability of Google searches, Twitter posts, electronic health records, and a crowd-sourced influenza reporting system to detect influenza activity in the Boston metropolis separately. We then adapted a multivariate dynamic regression method named ARGO (autoregression with general online information), designed for tracking influenza at the national level, and showed that it effectively uses the above data sources to monitor and forecast influenza at the city level 1 week ahead of the current date. Finally, we presented an ensemble-based approach capable of combining information from models based on multiple data sources to more robustly nowcast as well as forecast influenza activity in the Boston metropolitan area. The performances of our models were evaluated in an out-of-sample fashion over 4 influenza seasons within 2012-2016, as well as a holdout validation period from 2016 to 2017. Results Our ensemble-based methods incorporating information from diverse models based on multiple data sources, including ARGO, produced the most robust and accurate results. The observed Pearson correlations between our out-of-sample flu activity estimates and those historically reported by the BPHC were 0.98 in nowcasting influenza and 0.94 in forecasting influenza 1 week ahead of the current date. Conclusions We show that information from Internet-based data sources, when combined using an informed, robust methodology, can be effectively used as early indicators of influenza activity at fine geographic resolutions.
Collapse
Affiliation(s)
- Fred Sun Lu
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
| | - Suqin Hou
- Harvard Chan School of Public Health, Harvard University, Boston, MA, United States
| | - Kristin Baltrusaitis
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, United States
| | - Manan Shah
- Computer Science Department, Stanford University, Stanford, CA, United States
| | - Jure Leskovec
- Computer Science Department, Stanford University, Stanford, CA, United States.,Chan Zuckerberg Biohub, San Francisco, CA, United States
| | - Rok Sosic
- Computer Science Department, Stanford University, Stanford, CA, United States
| | - Jared Hawkins
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.,Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| | - John Brownstein
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.,Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| | | | - Julia Gunn
- Boston Public Health Commission, Boston, MA, United States
| | - Josh Gray
- athenaResearch, athenahealth, Watertown, MA, United States
| | - Anna Zink
- athenaResearch, athenahealth, Watertown, MA, United States
| | - Mauricio Santillana
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.,Department of Pediatrics, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
23
|
Fleischauer AT, Gaines J. Enhancing Surveillance for Mass Gatherings: The Role of Syndromic Surveillance. Public Health Rep 2018; 132:95S-98S. [PMID: 28692398 DOI: 10.1177/0033354917706343] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Affiliation(s)
- Aaron T Fleischauer
- 1 Office of Public Health Preparedness and Response, Centers for Disease Control and Prevention, Atlanta, GA, USA.,2 Division of Public Health, North Carolina Department of Health and Human Services, Raleigh, NC, USA
| | - Joanna Gaines
- 3 Division of Global Migration and Quarantine, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| |
Collapse
|
24
|
de Lusignan S, Shinneman S, Yonova I, van Vlymen J, Elliot AJ, Bolton F, Smith GE, O'Brien S. An Ontology to Improve Transparency in Case Definition and Increase Case Finding of Infectious Intestinal Disease: Database Study in English General Practice. JMIR Med Inform 2017; 5:e34. [PMID: 28958989 PMCID: PMC5639210 DOI: 10.2196/medinform.7641] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Revised: 06/20/2017] [Accepted: 06/27/2017] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Infectious intestinal disease (IID) has considerable health impact; there are 2 billion cases worldwide resulting in 1 million deaths and 78.7 million disability-adjusted life years lost. Reported IID incidence rates vary and this is partly because terms such as "diarrheal disease" and "acute infectious gastroenteritis" are used interchangeably. Ontologies provide a method of transparently comparing case definitions and disease incidence rates. OBJECTIVE This study sought to show how differences in case definition in part account for variation in incidence estimates for IID and how an ontological approach provides greater transparency to IID case finding. METHODS We compared three IID case definitions: (1) Royal College of General Practitioners Research and Surveillance Centre (RCGP RSC) definition based on mapping to the Ninth International Classification of Disease (ICD-9), (2) newer ICD-10 definition, and (3) ontological case definition. We calculated incidence rates and examined the contribution of four supporting concepts related to IID: symptoms, investigations, process of care (eg, notification to public health authorities), and therapies. We created a formal ontology using ontology Web language. RESULTS The ontological approach identified 5712 more cases of IID than the ICD-10 definition and 4482 more than the RCGP RSC definition from an initial cohort of 1,120,490. Weekly incidence using the ontological definition was 17.93/100,000 (95% CI 15.63-20.41), whereas for the ICD-10 definition the rate was 8.13/100,000 (95% CI 6.70-9.87), and for the RSC definition the rate was 10.24/100,000 (95% CI 8.55-12.12). Codes from the four supporting concepts were generally consistent across our three IID case definitions: 37.38% (3905/10,448) (95% CI 36.16-38.5) for the ontological definition, 38.33% (2287/5966) (95% CI 36.79-39.93) for the RSC definition, and 40.82% (1933/4736) (95% CI 39.03-42.66) for the ICD-10 definition. The proportion of laboratory results associated with a positive test result was 19.68% (546/2775). CONCLUSIONS The standard RCGP RSC definition of IID, and its mapping to ICD-10, underestimates disease incidence. The ontological approach identified a larger proportion of new IID cases; the ontology divides contributory elements and enables transparency and comparison of rates. Results illustrate how improved diagnostic coding of IID combined with an ontological approach to case definition would provide a clearer picture of IID in the community, better inform GPs and public health services about circulating disease, and empower them to respond. We need to improve the Pathology Bounded Code List (PBCL) currently used by laboratories to electronically report results. Given advances in stool microbiology testing with a move to nonculture, PCR-based methods, the way microbiology results are reported and coded via PBCL needs to be reviewed and modernized.
Collapse
Affiliation(s)
- Simon de Lusignan
- Section of Clinical Medicine and Ageing, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, United Kingdom
- Royal College of General Practitioners, Research and Surveillance Centre, London, United Kingdom
| | - Stacy Shinneman
- Section of Clinical Medicine and Ageing, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, United Kingdom
| | - Ivelina Yonova
- Section of Clinical Medicine and Ageing, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, United Kingdom
- Royal College of General Practitioners, Research and Surveillance Centre, London, United Kingdom
| | - Jeremy van Vlymen
- Section of Clinical Medicine and Ageing, Department of Clinical and Experimental Medicine, University of Surrey, Guildford, United Kingdom
| | - Alex J Elliot
- Real-time Syndromic Surveillance Team, National Infection Service, Public Health England, Birmingham, United Kingdom
| | - Frederick Bolton
- Epidemiology and Population Health, University of Liverpool, Liverpool, United Kingdom
| | - Gillian E Smith
- Real-time Syndromic Surveillance Team, National Infection Service, Public Health England, Birmingham, United Kingdom
| | - Sarah O'Brien
- Institute of Psychology Health and Society, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
25
|
|
26
|
Utility and potential of rapid epidemic intelligence from internet-based sources. Int J Infect Dis 2017; 63:77-87. [PMID: 28765076 DOI: 10.1016/j.ijid.2017.07.020] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2017] [Revised: 07/19/2017] [Accepted: 07/21/2017] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVES Rapid epidemic detection is an important objective of surveillance to enable timely intervention, but traditional validated surveillance data may not be available in the required timeframe for acute epidemic control. Increasing volumes of data on the Internet have prompted interest in methods that could use unstructured sources to enhance traditional disease surveillance and gain rapid epidemic intelligence. We aimed to summarise Internet-based methods that use freely-accessible, unstructured data for epidemic surveillance and explore their timeliness and accuracy outcomes. METHODS Steps outlined in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist were used to guide a systematic review of research related to the use of informal or unstructured data by Internet-based intelligence methods for surveillance. RESULTS We identified 84 articles published between 2006-2016 relating to Internet-based public health surveillance methods. Studies used search queries, social media posts and approaches derived from existing Internet-based systems for early epidemic alerts and real-time monitoring. Most studies noted improved timeliness compared to official reporting, such as in the 2014 Ebola epidemic where epidemic alerts were generated first from ProMED-mail. Internet-based methods showed variable correlation strength with official datasets, with some methods showing reasonable accuracy. CONCLUSION The proliferation of publicly available information on the Internet provided a new avenue for epidemic intelligence. Methodologies have been developed to collect Internet data and some systems are already used to enhance the timeliness of traditional surveillance systems. To improve the utility of Internet-based systems, the key attributes of timeliness and data accuracy should be included in future evaluations of surveillance systems.
Collapse
|
27
|
Influenza detection and prediction algorithms: comparative accuracy trial in Östergötland county, Sweden, 2008–2012. Epidemiol Infect 2017; 145:2166-2175. [DOI: 10.1017/s0950268817001005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
SUMMARYMethods for the detection of influenza epidemics and prediction of their progress have seldom been comparatively evaluated using prospective designs. This study aimed to perform a prospective comparative trial of algorithms for the detection and prediction of increased local influenza activity. Data on clinical influenza diagnoses recorded by physicians and syndromic data from a telenursing service were used. Five detection and three prediction algorithms previously evaluated in public health settings were calibrated and then evaluated over 3 years. When applied on diagnostic data, only detection using the Serfling regression method and prediction using the non-adaptive log-linear regression method showed acceptable performances during winter influenza seasons. For the syndromic data, none of the detection algorithms displayed a satisfactory performance, while non-adaptive log-linear regression was the best performing prediction method. We conclude that evidence was found for that available algorithms for influenza detection and prediction display satisfactory performance when applied on local diagnostic data during winter influenza seasons. When applied on local syndromic data, the evaluated algorithms did not display consistent performance. Further evaluations and research on combination of methods of these types in public health information infrastructures for ‘nowcasting’ (integrated detection and prediction) of influenza activity are warranted.
Collapse
|
28
|
Riccardo F, Manso MD, Caporali MG, Napoli C, Linge JP, Mantica E, Verile M, Piatti A, Pompa MG, Vellucci L, Costanzo V, Bastiampillai AJ, Gabrielli E, Gramegna M, Declich S. Event-Based Surveillance During EXPO Milan 2015: Rationale, Tools, Procedures, and Initial Results. Health Secur 2017; 14:161-72. [PMID: 27314656 PMCID: PMC4931307 DOI: 10.1089/hs.2015.0075] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
More than 21 million participants attended EXPO Milan from May to October 2015, making it one of the largest protracted mass gathering events in Europe. Given the expected national and international population movement and health security issues associated with this event, Italy fully implemented, for the first time, an event-based surveillance (EBS) system focusing on naturally occurring infectious diseases and the monitoring of biological agents with potential for intentional release. The system started its pilot phase in March 2015 and was fully operational between April and November 2015. In order to set the specific objectives of the EBS system, and its complementary role to indicator-based surveillance, we defined a list of priority diseases and conditions. This list was designed on the basis of the probability and possible public health impact of infectious disease transmission, existing statutory surveillance systems in place, and any surveillance enhancements during the mass gathering event. This article reports the methodology used to design the EBS system for EXPO Milan and the results of 8 months of surveillance. More than 21 million participants attended EXPO Milan from May to October 2015, making it one of the largest protracted mass gathering events in Europe. Given the expected national and international population movement and health security issues associated with this event, Italy fully implemented, for the first time, an event-based surveillance system focusing on naturally occurring infectious diseases and the monitoring of biological agents with potential for intentional release. This article reports the methodology used to design the event-based surveillance system for EXPO Milan and the results of 8 months of surveillance.
Collapse
|
29
|
Nayak S, Blumenfeld NR, Laksanasopin T, Sia SK. Point-of-Care Diagnostics: Recent Developments in a Connected Age. Anal Chem 2017; 89:102-123. [PMID: 27958710 PMCID: PMC5793870 DOI: 10.1021/acs.analchem.6b04630] [Citation(s) in RCA: 292] [Impact Index Per Article: 41.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Samiksha Nayak
- Department of Biomedical Engineering, Columbia University, 351 Engineering Terrace, 1210 Amsterdam Avenue, New York, NY 10027, USA
| | - Nicole R. Blumenfeld
- Department of Biomedical Engineering, Columbia University, 351 Engineering Terrace, 1210 Amsterdam Avenue, New York, NY 10027, USA
| | - Tassaneewan Laksanasopin
- Biological Engineering Program, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, 126 Pracha Uthit Rd., Bang Mod, Thung Khru, Bangkok 10140, Thailand
| | - Samuel K. Sia
- Department of Biomedical Engineering, Columbia University, 351 Engineering Terrace, 1210 Amsterdam Avenue, New York, NY 10027, USA
| |
Collapse
|
30
|
Shah GH, Leider JP, Luo H, Kaur R. Interoperability of Information Systems Managed and Used by the Local Health Departments. JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE 2016; 22 Suppl 6, Public Health Informatics:S34-S43. [PMID: 27684616 PMCID: PMC5049946 DOI: 10.1097/phh.0000000000000436] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
BACKGROUND In the post-Affordable Care Act era marked by interorganizational collaborations and availability of large amounts of electronic data from other community partners, it is imperative to assess the interoperability of information systems used by the local health departments (LHDs). OBJECTIVES To describe the level of interoperability of LHD information systems and identify factors associated with lack of interoperability. DATA AND METHODS This mixed-methods research uses data from the 2015 Informatics Capacity and Needs Assessment Survey, with a target population of all LHDs in the United States. A representative sample of 650 LHDs was drawn using a stratified random sampling design. A total of 324 completed responses were received (50% response rate). Qualitative data were used from a key informant interview study of LHD informatics staff from across the United States. Qualitative data were independently coded by 2 researchers and analyzed thematically. Survey data were cleaned, bivariate comparisons were conducted, and a multivariable logistic regression was run to characterize factors associated with interoperability. RESULTS For 30% of LHDs, no systems were interoperable, and 38% of LHD respondents indicated some of the systems were interoperable. Significant determinants of interoperability included LHDs having leadership support (adjusted odds ratio [AOR] = 3.54), control of information technology budget allocation (AOR = 2.48), control of data systems (AOR = 2.31), having a strategic plan for information systems (AOR = 1.92), and existence of business process analysis and redesign (AOR = 1.49). CONCLUSION Interoperability of all systems may be an informatics goal, but only a small proportion of LHDs reported having interoperable systems, pointing to a substantial need among LHDs nationwide.
Collapse
Affiliation(s)
- Gulzar H. Shah
- Department of Health Policy and Management, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia (Drs Shah and Kaur); de Beaumont Foundation, Bethesda, Maryland (Dr Leider); and Department of Public Health, Brody School of Medicine, East Carolina University, North Carolina (Dr Luo)
| | - Jonathon P. Leider
- Department of Health Policy and Management, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia (Drs Shah and Kaur); de Beaumont Foundation, Bethesda, Maryland (Dr Leider); and Department of Public Health, Brody School of Medicine, East Carolina University, North Carolina (Dr Luo)
| | - Huabin Luo
- Department of Health Policy and Management, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia (Drs Shah and Kaur); de Beaumont Foundation, Bethesda, Maryland (Dr Leider); and Department of Public Health, Brody School of Medicine, East Carolina University, North Carolina (Dr Luo)
| | - Ravneet Kaur
- Department of Health Policy and Management, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro, Georgia (Drs Shah and Kaur); de Beaumont Foundation, Bethesda, Maryland (Dr Leider); and Department of Public Health, Brody School of Medicine, East Carolina University, North Carolina (Dr Luo)
| |
Collapse
|
31
|
Conway M, O'Connor D. Social Media, Big Data, and Mental Health: Current Advances and Ethical Implications. Curr Opin Psychol 2016; 9:77-82. [PMID: 27042689 DOI: 10.1016/j.copsyc.2016.01.004] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Mental health (including substance abuse) is the fifth greatest contributor to the global burden of disease, with an economic cost estimated to be US $2.5 trillion in 2010, and expected to double by 2030. Developing information systems to support and strengthen population-level mental health monitoring forms a core part of the World Health Organization's Comprehensive Action Plan 2013-2020. In this paper, we review recent work that utilizes social media "big data" in conjunction with associated technologies like natural language processing and machine learning to address pressing problems in population-level mental health surveillance and research, focusing both on technological advances and core ethical challenges.
Collapse
Affiliation(s)
- Mike Conway
- Department of Biomedical Informatics, University of Utah, 421 Wakara Way, Salt Lake City, Utah, United States
| | - Daniel O'Connor
- Wellcome Trust, Gibbs Building, 215 Euston Road, London NW1 2BE, United Kingdom
| |
Collapse
|
32
|
Li EY, Tung CY, Chang SH. The wisdom of crowds in action: Forecasting epidemic diseases with a web-based prediction market system. Int J Med Inform 2016; 92:35-43. [PMID: 27318069 DOI: 10.1016/j.ijmedinf.2016.04.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Revised: 03/08/2016] [Accepted: 04/26/2016] [Indexed: 10/21/2022]
Abstract
BACKGROUND The quest for an effective system capable of monitoring and predicting the trends of epidemic diseases is a critical issue for communities worldwide. With the prevalence of Internet access, more and more researchers today are using data from both search engines and social media to improve the prediction accuracy. In particular, a prediction market system (PMS) exploits the wisdom of crowds on the Internet to effectively accomplish relatively high accuracy. OBJECTIVE This study presents the architecture of a PMS and demonstrates the matching mechanism of logarithmic market scoring rules. The system was implemented to predict infectious diseases in Taiwan with the wisdom of crowds in order to improve the accuracy of epidemic forecasting. METHODS The PMS architecture contains three design components: database clusters, market engine, and Web applications. The system accumulated knowledge from 126 health professionals for 31 weeks to predict five disease indicators: the confirmed cases of dengue fever, the confirmed cases of severe and complicated influenza, the rate of enterovirus infections, the rate of influenza-like illnesses, and the confirmed cases of severe and complicated enterovirus infection. RESULTS Based on the winning ratio, the PMS predicts the trends of three out of five disease indicators more accurately than does the existing system that uses the five-year average values of historical data for the same weeks. In addition, the PMS with the matching mechanism of logarithmic market scoring rules is easy to understand for health professionals and applicable to predict all the five disease indicators. CONCLUSIONS The PMS architecture of this study affords organizations and individuals to implement it for various purposes in our society. The system can continuously update the data and improve prediction accuracy in monitoring and forecasting the trends of epidemic diseases. Future researchers could replicate and apply the PMS demonstrated in this study to more infectious diseases and wider geographical areas, especially the under-developed countries across Asia and Africa.
Collapse
Affiliation(s)
- Eldon Y Li
- Department of Management Information Systems, National Chengchi University, Taipei City 11605, Taiwan, ROC.
| | - Chen-Yuan Tung
- Graduate Institute of Development Studies, National Chengchi University, Taipei City 11605, Taiwan, ROC.
| | - Shu-Hsun Chang
- Department of Management Information Systems, National Chengchi University, Taipei City 11605, Taiwan, ROC.
| |
Collapse
|
33
|
Mikal J, Hurst S, Conway M. Ethical issues in using Twitter for population-level depression monitoring: a qualitative study. BMC Med Ethics 2016; 17:22. [PMID: 27080238 PMCID: PMC4832544 DOI: 10.1186/s12910-016-0105-5] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Accepted: 04/06/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recently, significant research effort has focused on using Twitter (and other social media) to investigate mental health at the population-level. While there has been influential work in developing ethical guidelines for Internet discussion forum-based research in public health, there is currently limited work focused on addressing ethical problems in Twitter-based public health research, and less still that considers these issues from users' own perspectives. In this work, we aim to investigate public attitudes towards utilizing public domain Twitter data for population-level mental health monitoring using a qualitative methodology. METHODS The study explores user perspectives in a series of five, 2-h focus group interviews. Following a semi-structured protocol, 26 Twitter users with and without a diagnosed history of depression discussed general Twitter use, along with privacy expectations, and ethical issues in using social media for health monitoring, with a particular focus on mental health monitoring. Transcripts were then transcribed, redacted, and coded using a constant comparative approach. RESULTS While participants expressed a wide range of opinions, there was an overall trend towards a relatively positive view of using public domain Twitter data as a resource for population level mental health monitoring, provided that results are appropriately aggregated. Results are divided into five sections: (1) a profile of respondents' Twitter use patterns and use variability; (2) users' privacy expectations, including expectations regarding data reach and permanence; (3) attitudes towards social media based population-level health monitoring in general, and attitudes towards mental health monitoring in particular; (4) attitudes towards individual versus population-level health monitoring; and (5) users' own recommendations for the appropriate regulation of population-level mental health monitoring. CONCLUSIONS Focus group data reveal a wide range of attitudes towards the use of public-domain social media "big data" in population health research, from enthusiasm, through acceptance, to opposition. Study results highlight new perspectives in the discussion of ethical use of public data, particularly with respect to consent, privacy, and oversight.
Collapse
Affiliation(s)
- Jude Mikal
- />Minnesota Population Center, University of Minnesota, Twin Cities, 50 Willey Hall, 225 – 19th Avenue South, Minneapolis, MN 55455 USA
| | - Samantha Hurst
- />Department of Family Medicine & Public Health, University of California, San Diego, MTF 162E, 9500 Gilman Drive, La Jolla, CA USA
| | - Mike Conway
- />Department of Biomedical Informatics, University of Utah, Rm 2008, 421 Wakara Way, #140, Salt Lake City, UT USA
| |
Collapse
|
34
|
Lee D, Lee H, Choi M. Examining the Relationship Between Past Orientation and US Suicide Rates: An Analysis Using Big Data-Driven Google Search Queries. J Med Internet Res 2016; 18:e35. [PMID: 26868917 PMCID: PMC4768042 DOI: 10.2196/jmir.4981] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Revised: 10/11/2015] [Accepted: 12/11/2015] [Indexed: 11/15/2022] Open
Abstract
Background Internet search query data reflect the attitudes of the users, using which we can measure the past orientation to commit suicide. Examinations of past orientation often highlight certain predispositions of attitude, many of which can be suicide risk factors. Objective To investigate the relationship between past orientation and suicide rate by examining Google search queries. Methods We measured the past orientation using Google search query data by comparing the search volumes of the past year and those of the future year, across the 50 US states and the District of Columbia during the period from 2004 to 2012. We constructed a panel dataset with independent variables as control variables; we then undertook an analysis using multiple ordinary least squares regression and methods that leverage the Akaike information criterion and the Bayesian information criterion. Results It was found that past orientation had a positive relationship with the suicide rate (P≤.001) and that it improves the goodness-of-fit of the model regarding the suicide rate. Unemployment rate (P≤.001 in Models 3 and 4), Gini coefficient (P≤.001), and population growth rate (P≤.001) had a positive relationship with the suicide rate, whereas the gross state product (P≤.001) showed a negative relationship with the suicide rate. Conclusions We empirically identified the positive relationship between the suicide rate and past orientation, which was measured by big data-driven Google search query.
Collapse
Affiliation(s)
- Donghyun Lee
- Korea Advanced Institute of Science and Technology, Graduate School of Innovation and Technology Management, Daejeon, Republic Of Korea
| | | | | |
Collapse
|
35
|
Elachola H, Gozzer E, Zhuo J, Sow S, Kattan R, Mimesh S, Al-Tawfiq J, Al-Sultan M, Memish Z. Mass gatherings: A one-stop opportunity to complement global disease surveillance. ACTA ACUST UNITED AC 2016. [DOI: 10.4103/2468-6360.186487] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
36
|
Tabatabaei SM, Metanat M. Mass Gatherings and Infectious Diseases Epidemiology and Surveillance. ACTA ACUST UNITED AC 2015. [DOI: 10.17795/iji-22833] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
37
|
Nsoesie EO, Kluberg SA, Mekaru SR, Majumder MS, Khan K, Hay SI, Brownstein JS. New digital technologies for the surveillance of infectious diseases at mass gathering events. Clin Microbiol Infect 2015; 21:134-40. [PMID: 25636385 PMCID: PMC4332877 DOI: 10.1016/j.cmi.2014.12.017] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 12/18/2014] [Accepted: 12/19/2014] [Indexed: 11/17/2022]
Abstract
Outbreaks of infectious diseases at mass gatherings can strain the health system of the host region and pose a threat to local and global health. In addition to strengthening existing surveillance systems, most host nations also use novel technologies to assess disease risk and augment traditional surveillance approaches. We review novel approaches to disease surveillance using the Internet, mobile phone applications, and wireless sensor networks. These novel approaches to disease surveillance can result in prompt detection.
Collapse
Affiliation(s)
- E O Nsoesie
- Children's Hospital Informatics Program, Boston Children's Hospital, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA.
| | - S A Kluberg
- Children's Hospital Informatics Program, Boston Children's Hospital, MA, USA
| | - S R Mekaru
- Children's Hospital Informatics Program, Boston Children's Hospital, MA, USA
| | - M S Majumder
- Children's Hospital Informatics Program, Boston Children's Hospital, MA, USA; Engineering Systems Division, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - K Khan
- Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, ON, Canada
| | - S I Hay
- Spatial Ecology and Epidemiology Group, Department of Zoology, University of Oxford, Oxford, UK; Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
| | - J S Brownstein
- Children's Hospital Informatics Program, Boston Children's Hospital, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA; Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada
| |
Collapse
|
38
|
Conway M. Ethical issues in using Twitter for public health surveillance and research: developing a taxonomy of ethical concepts from the research literature. J Med Internet Res 2014; 16:e290. [PMID: 25533619 PMCID: PMC4285736 DOI: 10.2196/jmir.3617] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 10/28/2014] [Indexed: 11/17/2022] Open
Abstract
Background The rise of social media and microblogging platforms in recent years, in conjunction with the development of techniques for the processing and analysis of “big data”, has provided significant opportunities for public health surveillance using user-generated content. However, relatively little attention has been focused on developing ethically appropriate approaches to working with these new data sources. Objective Based on a review of the literature, this study seeks to develop a taxonomy of public health surveillance-related ethical concepts that emerge when using Twitter data, with a view to: (1) explicitly identifying a set of potential ethical issues and concerns that may arise when researchers work with Twitter data, and (2) providing a starting point for the formation of a set of best practices for public health surveillance through the development of an empirically derived taxonomy of ethical concepts. Methods We searched Medline, Compendex, PsycINFO, and the Philosopher’s Index using a set of keywords selected to identify Twitter-related research papers that reference ethical concepts. Our initial set of queries identified 342 references across the four bibliographic databases. We screened titles and abstracts of these references using our inclusion/exclusion criteria, eliminating duplicates and unavailable papers, until 49 references remained. We then read the full text of these 49 articles and discarded 36, resulting in a final inclusion set of 13 articles. Ethical concepts were then identified in each of these 13 articles. Finally, based on a close reading of the text, a taxonomy of ethical concepts was constructed based on ethical concepts discovered in the papers. Results From these 13 articles, we iteratively generated a taxonomy of ethical concepts consisting of 10 top level categories: privacy, informed consent, ethical theory, institutional review board (IRB)/regulation, traditional research vs Twitter research, geographical information, researcher lurking, economic value of personal information, medical exceptionalism, and benefit of identifying socially harmful medical conditions. Conclusions In summary, based on a review of the literature, we present a provisional taxonomy of public health surveillance-related ethical concepts that emerge when using Twitter data.
Collapse
Affiliation(s)
- Mike Conway
- University of California San Diego, Department of Family and Preventive Medicine, La Jolla, CA, United States.
| |
Collapse
|
39
|
Lee JL, DeCamp M, Dredze M, Chisolm MS, Berger ZD. What are health-related users tweeting? A qualitative content analysis of health-related users and their messages on twitter. J Med Internet Res 2014; 16:e237. [PMID: 25591063 PMCID: PMC4296104 DOI: 10.2196/jmir.3765] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 09/12/2014] [Accepted: 09/16/2014] [Indexed: 11/24/2022] Open
Abstract
Background Twitter is home to many health professionals who send messages about a variety of health-related topics. Amid concerns about physicians posting inappropriate content online, more in-depth knowledge about these messages is needed to understand health professionals’ behavior on Twitter. Objective Our goal was to characterize the content of Twitter messages, specifically focusing on health professionals and their tweets relating to health. Methods We performed an in-depth content analysis of 700 tweets. Qualitative content analysis was conducted on tweets by health users on Twitter. The primary objective was to describe the general type of content (ie, health-related versus non-health related) on Twitter authored by health professionals and further to describe health-related tweets on the basis of the type of statement made. Specific attention was given to whether a tweet was personal (as opposed to professional) or made a claim that users would expect to be supported by some level of medical evidence (ie, a “testable” claim). A secondary objective was to compare content types among different users, including patients, physicians, nurses, health care organizations, and others. Results Health-related users are posting a wide range of content on Twitter. Among health-related tweets, 53.2% (184/346) contained a testable claim. Of health-related tweets by providers, 17.6% (61/346) were personal in nature; 61% (59/96) made testable statements. While organizations and businesses use Twitter to promote their services and products, patient advocates are using this tool to share their personal experiences with health. Conclusions Twitter users in health-related fields tweet about both testable claims and personal experiences. Future work should assess the relationship between testable tweets and the actual level of evidence supporting them, including how Twitter users—especially patients—interpret the content of tweets posted by health providers.
Collapse
Affiliation(s)
- Joy L Lee
- Johns Hopkins Bloomberg School of Public Health, Department of Health Policy & Management, Baltimore, MD, United States.
| | | | | | | | | |
Collapse
|
40
|
Mao C, Wu XY, Fu XH, Di MY, Yu YY, Yuan JQ, Yang ZY, Tang JL. An internet-based epidemiological investigation of the outbreak of H7N9 Avian influenza A in China since early 2013. J Med Internet Res 2014; 16:e221. [PMID: 25257217 PMCID: PMC4211021 DOI: 10.2196/jmir.3763] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Accepted: 09/06/2014] [Indexed: 11/30/2022] Open
Abstract
Background In early 2013, a new type of avian influenza, H7N9, emerged in China. It quickly became an issue of great public concern and a widely discussed topic on the Internet. A considerable volume of relevant information was made publicly available on the Internet through various sources. Objective This study aimed to describe the outbreak of H7N9 in China based on data openly available on the Internet and to validate our investigation by comparing our findings with a well-conducted conventional field epidemiologic study. Methods We searched publicly accessible Internet data on the H7N9 outbreak primarily from government and major mass media websites in China up to February 10, 2014. Two researchers independently extracted, compared, and confirmed the information of each confirmed H7N9 case using a self-designed data extraction form. We summarized the epidemiological and clinical characteristics of confirmed H7N9 cases and compared them with those from the field study. Results According to our data updated until February 10, 2014, 334 confirmed H7N9 cases were identified. The median age was 58 years and 67.0% (219/327) were males. Cases were reported in 15 regions in China. Five family clusters were found. Of the 16.8% (56/334) of the cases with relevant data, 69.6% (39/56) reported a history of exposure to animals. Of the 1751 persons with a close contact with a confirmed case, 0.6% (11/1751) of them developed respiratory symptoms during the 7-day surveillance period. In the 97.9% (327/334) of the cases with relevant data, 21.7% (71/327) died, 20.8% (68/327) were discharged from a hospital, and 57.5% (188/327) were of uncertain status. We compared our findings before February 10, 2014 and those before December 1, 2013 with those from the conventional field study, which had the latter cutoff date of ours in data collection. Our study showed most epidemiological and clinical characteristics were similar to those in the field study, except for case fatality (71/327, 21.7% for our data before February 10; 45/138, 32.6% for our data before December 1; 47/139, 33.8% for the field study), time from illness onset to first medical care (4 days, 3 days, and 1 day), and time from illness onset to death (16.5 days, 17 days, and 21 days). Conclusions Findings from our Internet-based investigation were similar to those from the conventional field study in most epidemiological and clinical aspects of the outbreak. Importantly, publicly available Internet data are open to any interested researchers and can thus greatly facilitate the investigation and control of such outbreaks. With improved efforts for Internet data provision, Internet-based investigation has a great potential to become a quick, economical, novel approach to investigating sudden issues of great public concern that involve a relatively small number of cases like this H7N9 outbreak.
Collapse
Affiliation(s)
- Chen Mao
- School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, Hong Kong, China (Hong Kong)
| | | | | | | | | | | | | | | |
Collapse
|