Language use on Twitter reflects social structure and social disparities
Journal article, Peer reviewed
Published version
Date
2023Metadata
Show full item recordCollections
- Institutt for psykologi [3205]
- Publikasjoner fra CRIStin - NTNU [39204]
Abstract
Large-scale mental health assessments increasingly rely upon user-contributed social media data. It is widely known that mental health and well-being are affected by minority group membership and social disparity. But do these factors manifest in the language use of social media users? We elucidate this question using spatial lag regressions. We examined the county-level (N = 1069) associations of lexical indicators linked to well-being and mental health, notably depression (e.g., first-person singular pronouns, negative emotions) with markers of social disparity (e.g., the Area Deprivation Index–3) and ethnicity, using a sample of approximately 30 million content-coded tweets (U.S. county-level aggregation). Results confirmed most expected associations: County-level lexical indicators of depression are positively linked with county-level area disparity (e.g., economic hardship and inequity) and percentage of ethnic minority groups. Predictive validity checks show that lexical indicators are related to future health and mental health outcomes. Lexical indicators of depression and adjustment coded from tweets aggregated at the county level could play a crucial role in prioritizing public health campaigns, particularly in socially deprived counties. Language use on Twitter reflects social structure and social disparities