1
|
Baminiwatte R, Torsu B, Scherbakov D, Mollalo A, Obeid JS, Alekseyenko AV, Lenert LA. Machine learning in healthcare citizen science: A scoping review. Int J Med Inform 2025; 195:105766. [PMID: 39740357 PMCID: PMC11810576 DOI: 10.1016/j.ijmedinf.2024.105766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 11/20/2024] [Accepted: 12/15/2024] [Indexed: 01/02/2025]
Abstract
OBJECTIVES This scoping review aims to clarify the definition and trajectory of citizen-led scientific research (so-called citizen science) within the healthcare domain, examine the degree of integration of machine learning (ML) and the participation levels of citizen scientists in health-related projects. MATERIALS AND METHODS In January and September 2024 we conducted a comprehensive search in PubMed, Scopus, Web of Science, and EBSCOhost platform for peer-reviewed publications that combine citizen science and machine learning (ML) in healthcare. Articles were excluded if citizens were merely passive data providers or if only professional scientists were involved. RESULTS Out of an initial 1,395 screened, 56 articles spanning from 2013 to 2024 met the inclusion criteria. The majority of research projects were conducted in the U.S. (n = 20, 35.7 %), followed by Germany (n = 6, 10.7 %), with Spain, Canada, and the UK each contributing three studies (5.4 %). Data collection was the primary form of citizen scientist involvement (n = 29, 51.8 %), which included capturing images, sharing data online, and mailing samples. Data annotation was the next most common activity (n = 15, 26.8 %), followed by participation in ML model challenges (n = 8, 14.3 %) and decision-making contributions (n = 3, 5.4 %). Mosquitoes (n = 10, 34.5 %) and air pollution samples (n = 7, 24.2 %) were the main data objects collected by citizens for ML analysis. Classification tasks were the most prevalent ML method (n = 30, 52.6 %), with Convolutional Neural Networks being the most frequently used algorithm (n = 13, 20 %). DISCUSSION AND CONCLUSIONS Citizen science in healthcare is currently an American and European construct with growing expansion in Asia. Citizens are contributing data, and labeling data for ML methods, but only infrequently analyzing or leading studies. Projects that use "crowd-sourced" data and "citizen science" should be differentiated depending on the degree of involvement of citizens.
Collapse
Affiliation(s)
- Ranga Baminiwatte
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Blessing Torsu
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Dmitry Scherbakov
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Abolfazl Mollalo
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Jihad S Obeid
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Alexander V Alekseyenko
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA
| | - Leslie A Lenert
- Biomedical Informatics Center, Department of Public Health Sciences, Medical University of South Carolina (MUSC), Charleston, SC 29425, USA.
| |
Collapse
|
2
|
Bolormaa E, Kim T, Gwak E, Choe SA, Martin Hilber A. Neighbourhood environment and early menarche among adolescent girls of five countries. EUR J CONTRACEP REPR 2024; 29:263-268. [PMID: 39166721 DOI: 10.1080/13625187.2024.2387648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 07/15/2024] [Accepted: 07/29/2024] [Indexed: 08/23/2024]
Abstract
INTRODUCTION We aim to investigate the relationship between individuals' perceptions of their neighbourhood environment and early menarche. METHODS This was a retrospective cohort study of 7,486 girls of Ethiopia, India, South Korea, the United Kingdom (UK), and the United States (US), born in 1997-2011 was analysed. Early menarche was defined as being below the 10th to 20th percentiles in each cohort, considering the varying distributions across countries. Perceived neighbourhood environments were assessed based on the responses for neighbourhood pollution, safety, and recreational facilities. We calculated the relative risk (RR) of early menarche for unfavourable environment. RESULTS The mean age at menarche was lowest in South Korea (10.6 years) and highest in Ethiopia (13.7 years). Unfavourable environment was associated with higher risk of early menarche overall (RR = 1.34, 95% confidence interval [CI]:1.09-1.65) and each country (3.03, 95% CI: 1.15-7.96 in Ethiopia; 1.99, 95% CI: 0.97-4.10 in India, 1.23, 95% CI: 0.67-2.27 in Korea; 1.26, 95% CI: 0.96-1.64 in the UK). Specifically, pollution (1.29, 95% CI: 1.03-1.62) and low safety (1.19, 95% CI: 1.60-1.88) were associated with early menarche. CONCLUSIONS Our finding highlights the potential role of perceived neighbourhood environment in the timing of puberty.
Collapse
Affiliation(s)
| | - Taemi Kim
- Department of Public Health, Korea University, Seoul, South Korea
| | - Eunson Gwak
- Department of Preventive Medicine, Korea University College of Medicine, Korea University, Seoul, South Korea
| | - Seung-Ah Choe
- Department of Preventive Medicine, Korea University College of Medicine, Korea University, Seoul, South Korea
- Research and Management Center for Health Risk of Particulate Matter, Seoul, Republic of Korea
| | - Adriane Martin Hilber
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| |
Collapse
|
3
|
Dahu BM, Martinez-Villar CI, Toubal IE, Alshehri M, Ouadou A, Khan S, Sheets LR, Scott GJ. Application of Machine Learning and Deep Neural Visual Features for Predicting Adult Obesity Prevalence in Missouri. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2024; 21:1534. [PMID: 39595801 PMCID: PMC11594122 DOI: 10.3390/ijerph21111534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Revised: 11/13/2024] [Accepted: 11/15/2024] [Indexed: 11/28/2024]
Abstract
This research study investigates and predicts the obesity prevalence in Missouri, utilizing deep neural visual features extracted from medium-resolution satellite imagery (Sentinel-2). By applying a deep convolutional neural network (DCNN), the study aims to predict the obesity rate of census tracts based on visual features in the satellite imagery that covers each tract. The study utilizes Sentinel-2 satellite images, processed using the ResNet-50 DCNN, to extract deep neural visual features (DNVF). Obesity prevalence data, sourced from the CDC's 2022 estimates, is analyzed at the census tract level. The datasets were integrated to apply a machine learning model to predict the obesity rates in 1052 different census tracts in Missouri. The analysis reveals significant associations between DNVF and obesity prevalence. The predictive models show moderate success in estimating and predicting obesity rates in various census tracts within Missouri. The study emphasizes the potential of using satellite imagery and advanced machine learning in public health research. It points to environmental factors as significant determinants of obesity, suggesting the need for targeted health interventions. Employing DNVF to explore and predict obesity rates offers valuable insights for public health strategies and calls for expanded research in diverse geographical contexts.
Collapse
Affiliation(s)
- Butros M. Dahu
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA; (S.K.); (L.R.S.); (G.J.S.)
- Department of Health Management and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Carlos I. Martinez-Villar
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA; (C.I.M.-V.); (I.E.T.); (M.A.); (A.O.)
| | - Imad Eddine Toubal
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA; (C.I.M.-V.); (I.E.T.); (M.A.); (A.O.)
| | - Mariam Alshehri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA; (C.I.M.-V.); (I.E.T.); (M.A.); (A.O.)
| | - Anes Ouadou
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA; (C.I.M.-V.); (I.E.T.); (M.A.); (A.O.)
| | - Solaiman Khan
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA; (S.K.); (L.R.S.); (G.J.S.)
| | - Lincoln R. Sheets
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA; (S.K.); (L.R.S.); (G.J.S.)
- Department of Health Management and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Grant J. Scott
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA; (S.K.); (L.R.S.); (G.J.S.)
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA; (C.I.M.-V.); (I.E.T.); (M.A.); (A.O.)
| |
Collapse
|
4
|
Zhang T, Wang L, Zhang Y, Hu Y, Zhang W. Assessing the nonlinear impact of green space exposure on psychological stress perception using machine learning and street view images. Front Public Health 2024; 12:1402536. [PMID: 39360258 PMCID: PMC11445142 DOI: 10.3389/fpubh.2024.1402536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Accepted: 08/13/2024] [Indexed: 10/04/2024] Open
Abstract
Introduction Urban green space (GS) exposure is recognized as a nature-based strategy for addressing urban challenges. However, the stress relieving effects and mechanisms of GS exposure are yet to be fully explored. The development of machine learning and street view images offers a method for large-scale measurement and precise empirical analysis. Methods This study focuses on the central area of Shanghai, examining the complex effects of GS exposure on psychological stress perception. By constructing a multidimensional psychological stress perception scale and integrating machine learning algorithms with extensive street view images data, we successfully developed a framework for measuring urban stress perception. Using the scores from the psychological stress perception scale provided by volunteers as labeled data, we predicted the psychological stress perception in Shanghai's central urban area through the Support Vector Machine (SVM) algorithm. Additionally, this study employed the interpretable machine learning model eXtreme Gradient Boosting (XGBoost) algorithm to reveal the nonlinear relationship between GS exposure and residents' psychological stress. Results Results indicate that the GS exposure in central Shanghai is generally low, with significant spatial heterogeneity. GS exposure has a positive impact on reducing residents' psychological stress. However, this effect has a threshold; when GS exposure exceeds 0.35, its impact on stress perception gradually diminishes. Discussion We recommend combining the threshold of stress perception with GS exposure to identify urban spaces, thereby guiding precise strategies for enhancing GS. This research not only demonstrates the complex mitigating effect of GS exposure on psychological stress perception but also emphasizes the importance of considering the "dose-effect" of it in urban planning and construction. Based on open-source data, the framework and methods developed in this study have the potential to be applied in different urban environments, thus providing more comprehensive support for future urban planning.
Collapse
Affiliation(s)
- Tianlin Zhang
- School of Architecture, Tianjin University, Tianjin, China
| | - Lei Wang
- School of Architecture, Tianjin University, Tianjin, China
| | - Yazhuo Zhang
- School of Civil Engineering, Tianjin University, Tianjin, China
| | - Yike Hu
- School of Architecture, Tianjin University, Tianjin, China
| | - Wenzheng Zhang
- School of Architecture, Tianjin University, Tianjin, China
| |
Collapse
|
5
|
Nguyen QC, Tasdizen T, Alirezaei M, Mane H, Yue X, Merchant JS, Yu W, Drew L, Li D, Nguyen TT. Neighborhood built environment, obesity, and diabetes: A Utah siblings study. SSM Popul Health 2024; 26:101670. [PMID: 38708409 PMCID: PMC11068633 DOI: 10.1016/j.ssmph.2024.101670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 04/02/2024] [Accepted: 04/05/2024] [Indexed: 05/07/2024] Open
Abstract
Background This study utilizes innovative computer vision methods alongside Google Street View images to characterize neighborhood built environments across Utah. Methods Convolutional Neural Networks were used to create indicators of street greenness, crosswalks, and building type on 1.4 million Google Street View images. The demographic and medical profiles of Utah residents came from the Utah Population Database (UPDB). We implemented hierarchical linear models with individuals nested within zip codes to estimate associations between neighborhood built environment features and individual-level obesity and diabetes, controlling for individual- and zip code-level characteristics (n = 1,899,175 adults living in Utah in 2015). Sibling random effects models were implemented to account for shared family attributes among siblings (n = 972,150) and twins (n = 14,122). Results Consistent with prior neighborhood research, the variance partition coefficients (VPC) of our unadjusted models nesting individuals within zip codes were relatively small (0.5%-5.3%), except for HbA1c (VPC = 23%), suggesting a small percentage of the outcome variance is at the zip code-level. However, proportional change in variance (PCV) attributable to zip codes after the inclusion of neighborhood built environment variables and covariates ranged between 11% and 67%, suggesting that these characteristics account for a substantial portion of the zip code-level effects. Non-single-family homes (indicator of mixed land use), sidewalks (indicator of walkability), and green streets (indicator of neighborhood aesthetics) were associated with reduced diabetes and obesity. Zip codes in the third tertile for non-single-family homes were associated with a 15% reduction (PR: 0.85; 95% CI: 0.79, 0.91) in obesity and a 20% reduction (PR: 0.80; 95% CI: 0.70, 0.91) in diabetes. This tertile was also associated with a BMI reduction of -0.68 kg/m2 (95% CI: -0.95, -0.40). Conclusion We observe associations between neighborhood characteristics and chronic diseases, accounting for biological, social, and cultural factors shared among siblings in this large population-based study.
Collapse
Affiliation(s)
- Quynh C. Nguyen
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Tolga Tasdizen
- Department of Electrical and Computer Engineering, Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, UT, United States
| | - Mitra Alirezaei
- Department of Electrical and Computer Engineering, Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, UT, United States
| | - Heran Mane
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Xiaohe Yue
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Junaid S. Merchant
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Weijun Yu
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Laura Drew
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| | - Dapeng Li
- Department of Geography and the Environment, University of Alabama, Tuscaloosa, AL, United States
| | - Thu T. Nguyen
- Department of Epidemiology and Biostatistics, University of Maryland School of Public Health, College Park, MD, United States
| |
Collapse
|
6
|
Wedyan M, Saeidi-Rizi F. Assessing the Impact of Urban Environments on Mental Health and Perception Using Deep Learning: A Review and Text Mining Analysis. J Urban Health 2024; 101:327-343. [PMID: 38466494 PMCID: PMC11052760 DOI: 10.1007/s11524-024-00830-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 03/13/2024]
Abstract
Understanding how outdoor environments affect mental health outcomes is vital in today's fast-paced and urbanized society. Recently, advancements in data-gathering technologies and deep learning have facilitated the study of the relationship between the outdoor environment and human perception. In a systematic review, we investigate how deep learning techniques can shed light on a better understanding of the influence of outdoor environments on human perceptions and emotions, with an emphasis on mental health outcomes. We have systematically reviewed 40 articles published in SCOPUS and the Web of Science databases which were the published papers between 2016 and 2023. The study presents and utilizes a novel topic modeling method to identify coherent keywords. By extracting the top words of each research topic, and identifying the current topics, we indicate that current studies are classified into three areas. The first topic was "Urban Perception and Environmental Factors" where the studies aimed to evaluate perceptions and mental health outcomes. Within this topic, the studies were divided based on human emotions, mood, stress, and urban features impacts. The second topic was titled "Data Analysis and Urban Imagery in Modeling" which focused on refining deep learning techniques, data collection methods, and participants' variability to understand human perceptions more accurately. The last topic was named "Greenery and visual exposure in urban spaces" which focused on the impact of the amount and the exposure of green features on mental health and perceptions. Upon reviewing the papers, this study provides a guide for subsequent research to enhance the view of using deep learning techniques to understand how urban environments influence mental health. It also provides various suggestions that should be taken into account when planning outdoor spaces.
Collapse
Affiliation(s)
- Musab Wedyan
- School of Planning, Design and Construction, Michigan State University, East Lansing, MI, USA
| | - Fatemeh Saeidi-Rizi
- School of Planning, Design and Construction, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
7
|
Hamim OF, Ukkusuri SV. Towards safer streets: A framework for unveiling pedestrians' perceived road safety using street view imagery. ACCIDENT; ANALYSIS AND PREVENTION 2024; 195:107400. [PMID: 38029553 DOI: 10.1016/j.aap.2023.107400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 10/24/2023] [Accepted: 11/21/2023] [Indexed: 12/01/2023]
Abstract
Road safety has become a global concern but its impact in low- and middle-income countries is widespread mainly due to lack of appropriate crash database system and under-reporting. In this context, the primary objective of this paper is to provide a scalable framework for unveiling pedestrians' perceived road safety that can also be applied in regions where accessible crash data are limited or near-crashes are left unreported. In the first step of our methodology, a deep learning architecture-based semantic segmentation model (HRNet+OCR) is trained using labeled Google Street View (GSV) images from specific study areas in Dhaka, Bangladesh, which facilitates the identification of both man-made components (such as roads, sidewalks, buildings, and vehicles) and natural elements (including trees and sky). The developed model showed excellent performance in identifying different features in an image by achieving high precision (0.95), recall (0.97), F1-score (0.96), and intersection over union (IoU) (91.86). Secondly, a group of trained raters scored the perceived road safety on an ordinal scale from 0 to 10 (extremely unsafe to extremely safe to walk in terms of road crashes) by assessing the GSV images. Then, several regression models have been used on features extracted from GSV images, and socio-demographic factors (i.e., population density, and relative wealth index) to estimate the perceived road safety, and random forest regression model was found to perform the best. Further, Shapley Additive Explanations (SHAP), a model-agnostic technique has been used for examining feature importance by computing the contribution of each feature to the random forest regression model output. The results show that sidewalk, road, population density, wall, and relative wealth index have higher impact on determining the perceived road safety rating. Additionally, the results of t-tests between the average perceived road safety scores for crash-prone and non crash-prone areas revealed the existence of significant differences. This study also provides perceived road safety rating map on a neighborhood scale, which can be a useful visualization tool for policy-makers and practitioners to identify the road safety deficiencies at specific locations, and formulate appropriate and strategic countermeasures to improve pedestrians' road safety.
Collapse
Affiliation(s)
- Omar Faruqe Hamim
- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN 47907, USA; Department of Civil Engineering, Bangladesh University of Engineering and Technology, Dhaka 1000, Bangladesh
| | - Satish V Ukkusuri
- Lyles School of Civil Engineering, Purdue University, West Lafayette, IN 47907, USA.
| |
Collapse
|