Mayor E, Bietti LM. Language use on Twitter reflects social structure and social disparities.
Heliyon 2024;
10:e23528. [PMID:
38293550 PMCID:
PMC10825303 DOI:
10.1016/j.heliyon.2023.e23528]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 11/24/2023] [Accepted: 12/05/2023] [Indexed: 02/01/2024] Open
Abstract
Large-scale mental health assessments increasingly rely upon user-contributed social media data. It is widely known that mental health and well-being are affected by minority group membership and social disparity. But do these factors manifest in the language use of social media users? We elucidate this question using spatial lag regressions. We examined the county-level (N = 1069) associations of lexical indicators linked to well-being and mental health, notably depression (e.g., first-person singular pronouns, negative emotions) with markers of social disparity (e.g., the Area Deprivation Index-3) and ethnicity, using a sample of approximately 30 million content-coded tweets (U.S. county-level aggregation). Results confirmed most expected associations: County-level lexical indicators of depression are positively linked with county-level area disparity (e.g., economic hardship and inequity) and percentage of ethnic minority groups. Predictive validity checks show that lexical indicators are related to future health and mental health outcomes. Lexical indicators of depression and adjustment coded from tweets aggregated at the county level could play a crucial role in prioritizing public health campaigns, particularly in socially deprived counties.
Collapse