1
|
Willem T, Wollek A, Cheslerean-Boghiu T, Kenney M, Buyx A. The Social Construction of Categorical Data: Mixed Methods Approach to Assessing Data Features in Publicly Available Datasets. JMIR Med Inform 2025; 13:e59452. [PMID: 39874567 DOI: 10.2196/59452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 11/14/2024] [Accepted: 11/17/2024] [Indexed: 01/30/2025] Open
Abstract
BACKGROUND In data-sparse areas such as health care, computer scientists aim to leverage as much available information as possible to increase the accuracy of their machine learning models' outputs. As a standard, categorical data, such as patients' gender, socioeconomic status, or skin color, are used to train models in fusion with other data types, such as medical images and text-based medical information. However, the effects of including categorical data features for model training in such data-scarce areas are underexamined, particularly regarding models intended to serve individuals equitably in a diverse population. OBJECTIVE This study aimed to explore categorical data's effects on machine learning model outputs, rooted the effects in the data collection and dataset publication processes, and proposed a mixed methods approach to examining datasets' data categories before using them for machine learning training. METHODS Against the theoretical background of the social construction of categories, we suggest a mixed methods approach to assess categorical data's utility for machine learning model training. As an example, we applied our approach to a Brazilian dermatological dataset (Dermatological and Surgical Assistance Program at the Federal University of Espírito Santo [PAD-UFES] 20). We first present an exploratory, quantitative study that assesses the effects when including or excluding each of the unique categorical data features of the PAD-UFES 20 dataset for training a transformer-based model using a data fusion algorithm. We then pair our quantitative analysis with a qualitative examination of the data categories based on interviews with the dataset authors. RESULTS Our quantitative study suggests scattered effects of including categorical data for machine learning model training across predictive classes. Our qualitative analysis gives insights into how the categorical data were collected and why they were published, explaining some of the quantitative effects that we observed. Our findings highlight the social constructedness of categorical data in publicly available datasets, meaning that the data in a category heavily depend on both how these categories are defined by the dataset creators and the sociomedico context in which the data are collected. This reveals relevant limitations of using publicly available datasets in contexts different from those of the collection of their data. CONCLUSIONS We caution against using data features of publicly available datasets without reflection on the social construction and context dependency of their categorical data features, particularly in data-sparse areas. We conclude that social scientific, context-dependent analysis of available data features using both quantitative and qualitative methods is helpful in judging the utility of categorical data for the population for which a model is intended.
Collapse
Affiliation(s)
- Theresa Willem
- Institute of History and Ethics in Medicine, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Helmholtz AI, Helmholtz Munich, Munich, Germany
| | - Alessandro Wollek
- Munich Institute of Biomedical Engineering, School of Computation, Information, and Technology, Technical University of Munich, Munich, Germany
| | - Theodor Cheslerean-Boghiu
- Munich Institute of Biomedical Engineering, School of Computation, Information, and Technology, Technical University of Munich, Munich, Germany
| | - Martha Kenney
- Women & Gender Studies, San Francisco State University, San Francisco, CA, United States
| | - Alena Buyx
- Institute of History and Ethics in Medicine, School of Medicine and Health, Technical University of Munich, Munich, Germany
| |
Collapse
|
2
|
Murray A, Browe D, Darling KW, Reardon J. Cells and the city: The rise and fall of urban biopolitics in San Francisco, 1970-2020. SOCIAL STUDIES OF SCIENCE 2024; 54:805-835. [PMID: 39041392 PMCID: PMC11588566 DOI: 10.1177/03063127241261376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/24/2024]
Abstract
STS theories of biocapital conceptualize how biomedical knowledge and capital form together. Though these formations of biocapital often are located in large urban centers, few scholars have attended to how they are transforming urban spaces and places. In this paper we argue that the twinned technological development of cells and cities concentrates economic and symbolic capital and sets in motion contentious practices we name urban biopolitics. We draw on archival research and a nearly decade-long ethnography of the expansion of biomedical campuses in a major American city to show how the speculative logics of land development and biomedical innovation become bound together in a process we describe as speculative revitalization. We examine how the logics of speculative revitalization imagine a future in which cities and biomedicine produce wealth and health harmoniously together. However, in practice-as buildings of new biomedical urban campuses get built-the dreams of billionaire philanthrocapitalists to create global cities clash with the plans of biomedical researchers to create global health. We document the reproduction of stratified and racialized biomedical exclusions that result while also highlighting the unlikely opportunities for creating alliances committed to creating equitable biomedical research and healthcare in urban communities.
Collapse
Affiliation(s)
- Andy Murray
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Dennis Browe
- University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Jenny Reardon
- University of California, Santa Cruz, Santa Cruz, CA, USA
| |
Collapse
|
3
|
Pot M, Spalletta O, Green S. Precision medicine in primary care: How GPs envision "old" and "new" forms of personalization. Soc Sci Med 2024; 358:117259. [PMID: 39181083 DOI: 10.1016/j.socscimed.2024.117259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 07/13/2024] [Accepted: 08/15/2024] [Indexed: 08/27/2024]
Abstract
Visions of precision or personalized medicine (PM) are gaining currency around the globe. While the potential of PM in specialist medicine has been in focus, primary care is also considered to be a fruitful area for the application of PM. "Low-tech" forms of personalization and attention to individual patients are already central features of primary care practice, and primary care thus constitutes an area in which "old" and "new" forms of personalization (may) come together. Against this backdrop, we explore general practitioners' (GPs) views on PM and how they envision the future of personalization in primary care. We draw on 45 qualitative interviews with GPs from Austria, Denmark, and the United States. Along the lines of major "promises" of PM-tailoring treatment decisions, improving disease prevention, empowering patients-we show that in some areas GPs consider PM to be a continuation or extension of existing practices of personalization, while in other cases, GPs envision that PM may negatively disrupt current forms of personalization in primary care. We suggest that this ambivalent stance towards PM can be understood through the lens of GPs' views on core values and practices of primary care.
Collapse
Affiliation(s)
- Mirjam Pot
- University of Vienna, Department of Political Science, Austria; European Centre for Social Welfare Policy and Research, Austria.
| | - Olivia Spalletta
- University of Copenhagen, Department of Science Education, Section for History and Philosophy of Science, Denmark; University of Copenhagen, Department of Public Health, Centre for Medical Science and Technology Studies, Denmark
| | - Sara Green
- University of Copenhagen, Department of Science Education, Section for History and Philosophy of Science, Denmark; University of Copenhagen, Department of Public Health, Centre for Medical Science and Technology Studies, Denmark
| |
Collapse
|
4
|
Aparicio A. Missing the "We" in Precision Medicine. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2024; 24:96-98. [PMID: 38394008 DOI: 10.1080/15265161.2024.2303140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2024]
|
5
|
Talias MA, Lamnisos D, Heraclides A. Editorial: Data science and health economics in precision public health. Front Public Health 2022; 10:960282. [PMID: 36561876 PMCID: PMC9765307 DOI: 10.3389/fpubh.2022.960282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 09/20/2022] [Indexed: 12/12/2022] Open
Affiliation(s)
- Michael A. Talias
- Healthcare Management Postgraduate Program, School of Economics and Management, Open University of Cyprus, Latsia, Cyprus,*Correspondence: Michael A. Talias
| | - Demetris Lamnisos
- Department of Health Sciences, European University Cyprus, Engomi, Cyprus
| | | |
Collapse
|
6
|
Wesson P, Hswen Y, Valdes G, Stojanovski K, Handley MA. Risks and Opportunities to Ensure Equity in the Application of Big Data Research in Public Health. Annu Rev Public Health 2022; 43:59-78. [PMID: 34871504 PMCID: PMC8983486 DOI: 10.1146/annurev-publhealth-051920-110928] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The big data revolution presents an exciting frontier to expand public health research, broadening the scope of research and increasing the precision of answers. Despite these advances, scientists must be vigilant against also advancing potential harms toward marginalized communities. In this review, we provide examples in which big data applications have (unintentionally) perpetuated discriminatory practices, while also highlighting opportunities for big data applications to advance equity in public health. Here, big data is framed in the context of the five Vs (volume, velocity, veracity, variety, and value), and we propose a sixth V, virtuosity, which incorporates equity and justice frameworks. Analytic approaches to improving equity are presented using social computational big data, fairness in machine learning algorithms, medical claims data, and data augmentation as illustrations. Throughout, we emphasize the biasing influence of data absenteeism and positionality and conclude with recommendations for incorporating an equity lens into big data research.
Collapse
Affiliation(s)
- Paul Wesson
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, USA
| | - Yulin Hswen
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, USA
| | - Gilmer Valdes
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
- Department of Radiation Oncology, University of California, San Francisco, California, USA
| | - Kristefer Stojanovski
- Department of Health Behavior and Health Education, School of Public Health, University of Michigan, Ann Arbor, Michigan, USA
- Department of Social, Behavioral and Population Sciences, School of Public Health and Tropical Medicine, Tulane University, New Orleans, Louisiana, USA
| | - Margaret A Handley
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
- Department of Medicine, University of California, San Francisco, California, USA
- Zuckerberg San Francisco General Hospital and Trauma Center, San Francisco, California, USA
- Partnerships for Research in Implementation Science for Equity (PRISE), University of California, San Francisco, California, USA
| |
Collapse
|
7
|
Johnson WG. Using Precision Public Health to Manage Climate Change: Opportunities, Challenges, and Health Justice. THE JOURNAL OF LAW, MEDICINE & ETHICS : A JOURNAL OF THE AMERICAN SOCIETY OF LAW, MEDICINE & ETHICS 2020; 48:681-693. [PMID: 33404333 DOI: 10.1177/1073110520979374] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Amid public health concerns over climate change, "precision public health" (PPH) is emerging in next generation approaches to practice. These novel methods promise to augment public health operations by using ever larger and more robust health datasets combined with new tools for collecting and analyzing data. Precision strategies to protecting the public health could more effectively or efficiently address the systemic threats of climate change, but may also propagate or exacerbate health disparities for the populations most vulnerable in a changing climate. How PPH interventions collect and aggregate data, decide what to measure, and analyze data pose potential issues around privacy, neglecting social determinants of health, and introducing algorithmic bias into climate responses. Adopting a health justice framework, guided by broader social and climate justice tenets, can reveal principles and policy actions which may guide more responsible implementation of PPH in climate responses.
Collapse
Affiliation(s)
- Walter G Johnson
- Walter G. Johnson, J.D. M.S.T.P., is a research fellow at the Sandra Day O'Connor College of Law, Arizona State University. He received a J.D. from the Sandra Day O'Connor College of Law in 2020 and a Master of Science and Technology Policy (M.S.T.P.) from Arizona State University in 2017
| |
Collapse
|
8
|
Abstract
OBJECTIVES This scoping review synthesizes the recent literature on precision public health and the influence of predictive models on health equity with the intent to highlight central concepts for each topic and identify research opportunities for the biomedical informatics community. METHODS Searches were conducted using PubMed for publications between 2017-01-01 and 2019-12-31. RESULTS Precision public health is defined as the use of data and evidence to tailor interventions to the characteristics of a single population. It differs from precision medicine in terms of its focus on populations and the limited role of human genomics. High-resolution spatial analysis in a global health context and application of genomics to infectious organisms are areas of progress. Opportunities for informatics research include (i) the development of frameworks for measuring non-clinical concepts, such as social position, (ii) the development of methods for learning from similar populations, and (iii) the evaluation of precision public health implementations. Just as the effects of interventions can differ across populations, predictive models can perform systematically differently across subpopulations due to information bias, sampling bias, random error, and the choice of the output. Algorithm developers, professional societies, and governments can take steps to prevent and mitigate these biases. However, even if the steps to avoid bias are clear in theory, they can be very challenging to accomplish in practice. CONCLUSIONS Both precision public health and predictive modelling require careful consideration in how subpopulations are defined and access to data on subpopulations can be challenging. While the theory for both topics has advanced considerably, there is much work to be done in understanding how to implement and evaluate these approaches in practice.
Collapse
|
9
|
Sariola S, Gilbert SF. Toward a Symbiotic Perspective on Public Health: Recognizing the Ambivalence of Microbes in the Anthropocene. Microorganisms 2020; 8:E746. [PMID: 32429344 PMCID: PMC7285259 DOI: 10.3390/microorganisms8050746] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Revised: 05/13/2020] [Accepted: 05/14/2020] [Indexed: 02/07/2023] Open
Abstract
Microbes evolve in complex environments that are often fashioned, in part, by human desires. In a global perspective, public health has played major roles in structuring how microbes are perceived, cultivated, and destroyed. The germ theory of disease cast microbes as enemies of the body and the body politic. Antibiotics have altered microbial development by providing stringent natural selection on bacterial species, and this has led to the formation of antibiotic-resistant bacterial strains. Public health perspectives such as "Precision Public Health" and "One Health" have recently been proposed to further manage microbial populations. However, neither of these take into account the symbiotic relationships that exist between bacterial species and between bacteria, viruses, and their eukaryotic hosts. We propose a perspective on public health that recognizes microbial evolution through symbiotic associations (the hologenome theory) and through lateral gene transfer. This perspective has the advantage of including both the pathogenic and beneficial interactions of humans with bacteria, as well as combining the outlook of the "One Health" model with the genomic methodologies utilized in the "Precision Public Health" model. In the Anthropocene, the conditions for microbial evolution have been altered by human interventions, and public health initiatives must recognize both the beneficial (indeed, necessary) interactions of microbes with their hosts as well as their pathogenic interactions.
Collapse
Affiliation(s)
- Salla Sariola
- Faculty of Social Sciences, Sociology, University of Helsinki, 00014 Helsinki, Finland;
| | - Scott F. Gilbert
- Department of Biology, Swarthmore College, Swarthmore, PA 19081, USA
| |
Collapse
|
10
|
Korzeniewski SJ, Bezold C, Carbone JT, Danagoulian S, Foster B, Misra D, El-Masri MM, Zhu D, Welch R, Meloche L, Hill AB, Levy P. The Population Health OutcomEs aNd Information EXchange (PHOENIX) Program - A Transformative Approach to Reduce the Burden of Chronic Disease. Online J Public Health Inform 2020; 12:e3. [PMID: 32577152 PMCID: PMC7295585 DOI: 10.5210/ojphi.v12i1.10456] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
This concept article introduces a transformative vision to reduce the population burden of chronic disease by focusing on data integration, analytics, implementation and community engagement. Known as PHOENIX (The Population Health OutcomEs aNd Information EXchange), the approach leverages a state level health information exchange and multiple other resources to facilitate the integration of clinical and social determinants of health data with a goal of achieving true population health monitoring and management. After reviewing historical context, we describe how multilevel and multimodal data can be used to facilitate core public health services, before discussing the controversies and challenges that lie ahead.
Collapse
|