Journal Home > Volume 2 , Issue 2

How do people talk about COVID-19 online? To address this question, we offer an unsupervised framework that allows us to examine Twitter framings of the pandemic. Our approach employs a network-based exploration of social media data to identify, categorize, and understand communication patterns about the novel coronavirus on Twitter. The simplest structure that emerges from our analysis is the distinction between the internal/personal, external/global, and generic threat framings of the pandemic. This structure replicates in different Twitter samples and is validated using the variation of information measure, reflecting the significance and stability of our findings. Such an exploratory study is useful for understanding the contours of the natural, non-random structure in this online space. We contend that this understanding of structure is necessary to address a host of causal, supervised, and related questions downstream.


menu
Abstract
Full text
Outline
About this article

Uncovering the Online Social Structure Surrounding COVID-19

Show Author's information Philip D. Waggoner1( )Robert Y. Shapiro2Samuel Frederick2Ming Gong3
Institute for Social and Economic Research and Policy, Columbia University, New York City, NY 10027, USA
Department of Political Science, Columbia University, New York City, NY 10027, USA
Department of Quantitative Methods in the Social Sciences, Columbia University, New York City, NY 10027, USA

Abstract

How do people talk about COVID-19 online? To address this question, we offer an unsupervised framework that allows us to examine Twitter framings of the pandemic. Our approach employs a network-based exploration of social media data to identify, categorize, and understand communication patterns about the novel coronavirus on Twitter. The simplest structure that emerges from our analysis is the distinction between the internal/personal, external/global, and generic threat framings of the pandemic. This structure replicates in different Twitter samples and is validated using the variation of information measure, reflecting the significance and stability of our findings. Such an exploratory study is useful for understanding the contours of the natural, non-random structure in this online space. We contend that this understanding of structure is necessary to address a host of causal, supervised, and related questions downstream.

Keywords: COVID-19, community detection, twitter, networks, exploratory data analysis

References(34)

1

S. R. Baker, N. Bloom, S. J. Davis, K. Kost, M. Sammon, and T. Viratyosin, The unprecedented stock market reaction to COVID-19, The Review of Asset Pricing Studies, vol. 10, no. 4, pp. 742–758, 2020.

2

T. Gonzalez, M. A. de la Rubia, K. P. Hincz, M. ComasLopez, L. Subirats, S. Fort, and G. M. Sacha, Influence of COVID-19 confinement on students’ performance in higher education, PLoS One, vol. 15, no. 10, p. e0239490, 2020.

3

D. Y. Li, H. Chaudhary, and Z. Zhang, Modeling spatiotemporal pattern of depressive symptoms caused by COVID-19 using social media data mining, Int. J. Environ. Res. Public Health, vol. 17, no. 14, p. 4988, 2020.

4

S. Flaxman, S. Mishra, A. Gandy, H. J. T. Unwin, T. A. Mellan, H. Coupland, C. Whittaker, H. Zhu, T. Berah, J. W. Eaton, et al., Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe, Nature, vol. 584, no. 7820, pp. 257–261, 2020.

5

M. A. Raifman and J. R. Raifman, Disparities in the population at risk of severe illness from COVID-19 by race/ethnicity and income, Am. J. Prev. Med., vol. 59, no. 1, pp. 137–139, 2020.

6

S. K. Gadarian, S. W. Goodman, and T. B. Pepinsky, Partisanship, health behavior, and policy attitudes in the early stages of the COVID-19 pandemic, PLoS One, vol. 16, no. 4, p. e0249596, 2021.

7

M. K. Gusmano, E. A. Miller, P. Nadash, and E. J. Simpson, Partisanship in initial state responses to the COVID-19 pandemic, World Med. Health Policy, vol. 12, no. 4, pp. 380–389, 2020.

8

R. Kouzy, J. Abi Jaoude, A. Kraitem, M. B. El Alam, B. Karam, E. Adib, J. Zarka, C. Traboulsi, E. W. Akl, and K. Baddour, Coronavirus goes viral: Quantifying the COVID- 19 misinformation epidemic on twitter, Cureus, vol. 12, no. 3, p. e7255, 2020.

9

H. Budhwani and R. Y. Sun, Creating COVID-19 stigma by referencing the novel coronavirus as the “Chinese virus” on twitter: Quantitative analysis of social media data, J. Med. Internet Res., vol. 22, no. 5, p. e19301, 2020.

10

M. O. Lwin, J. H. Lu, A. Sheldenkar, P. J. Schulz, W. Shin, R. Gupta, and Y. P. Yang, Global sentiments surrounding the COVID-19 pandemic on twitter: Analysis of twitter trends, JMIR Public Health Surveill., vol. 6, no. 2, p. e19447, 2020.

11

B. A. Panuganti, A. Jafari, B. MacDonald, and A. S. DeConde, Predicting COVID-19 incidence using anosmia and other COVID-19 symptomatology: Preliminary analysis using google and twitter, Otolaryngol. Head Neck Surg., vol. 163, no. 3, pp. 491–497, 2020.

12

D. Bisanzio, M. U. G. Kraemer, T. Brewer, J. S. Brownstein, and R. Reithinger, Geolocated twitter social media data to describe the geo-graphic spread of SARS-CoV-2, J. Travel Med., vol. 27, no. 5, p. taaa120, 2020.

13
S. Iyengar, Is Anyone Responsible? How Television Frames Political Issues. Chicago, IL, USA: University of Chicago Press, 1994.
14
S. Smith, Coronavirus (COVID-19) tweets – early April, https://www.kaggle.com/smid80/coronavirus-covid19-tweetsearly-april, 2020.
15
S. Smith, Coronavirus (COVID-19) tweets – late April, https://www.kaggle.com/smid80/coronavirus-covid19-tweets-late-april, 2020.
16
L. Derczynski, A. Ritter, S. Clark, and K. Bontcheva, Twitter part-of-speech tagging for all: Overcoming sparse and noisy data, in Recent Advances in Natural Language Processing, Hissar, Bulgaria, 2013, pp. 198–206.
17
T. Kwartler, Text Mining in Practice with R. Chichester, UK: John Wiley & Sons, 2017.https://doi.org/10.1002/9781119282105
DOI
18

A. Clauset, M. E. J. Newman, and C. Moore, Finding community structure in very large networks, Phys. Rev.E, vol. 70, no. 6, p. 066111, 2004.

19

V. D. Blondel, J. L. Guillaume, R. Lambiotte, and E. Lefebvre, Fast unfolding of communities in large networks, J. Stat. Mech.:Theory Exp., vol. 2008, no. 10, p. P10008, 2008.

20
P. Pons and M. Latapy, Computing communities in large networks using random walks, in Proc. 20th Int. Symp. Computer and Information Sciences, Istanbul, Turkey, 2005, pp. 284–293.https://doi.org/10.1007/11569596_31
DOI
21
P. D. Waggoner, Unsupervised Machine Learning for Clustering in Political and Social Research. Cambridge, UK: Cambridge University Press, 2020.https://doi.org/10.1017/9781108883955
DOI
22

B. Karrer, E. Levina, and M. E. J. Newman, Robustness of community structure in networks, Phys. Rev.E, vol. 77, no. 4, p. 046119, 2008.

23
V. Policastro, D. Righelli, A. Carissimo, L. Cutillo, and I. De Feis, ROBustness in network (Robin): An R package for comparison and validation of communities, arXiv preprint arXiv: 2102.03106, 2021.https://doi.org/10.32614/RJ-2021-040
DOI
24

B. Enjolras, K. Steen-Johnsen, and D. Wollebaek, Social media and mobilization to offline demonstrations: Transcending participatory divides? New Media Soc., vol. 15, no. 6, pp. 890–908, 2013.

25

S. Muralidharan, L. Rasmussen, D. Patterson, and J. H. Shin, Hope for Haiti: An analysis of facebook and twitter usage during the earthquake relief efforts, Public Relat. Rev., vol. 37, no. 2, pp. 175–177, 2011.

26

A. Ceron, L. Curini, S. M. Iacus, and G. Porro, Every tweet counts? How sentiment analysis of social media can improve our knowledge of citizens’ political preferences with an application to Italy and France New Media Soc., vol. 16, no. 2, pp. 340–358, 2014.

27

J. Clinton, J. Cohen, J. Lapinski, and M. Trussler, Partisan pandemic: How partisanship and public health concerns affect individuals’ social mobility during COVID-19, Sci. Adv., vol. 7, no. 2, p. eabd7204, 2021.

28

J. N. Druckman, S. Klar, Y. Krupnikov, M. Levendusky, and J. B. Ryan, Affective polarization, local contexts and public opinion in America,Nat. Hum. Behav., vol. 5, no. 1, pp. 28–38, 2021.

29

M. Meilă, Comparing clusterings—An information based distance, J. Multivariate Anal., vol. 98, no. 5, pp. 873–895, 2007.

30

A. Carissimo, L. Cutillo, and I. De Feis, Validation of community robustness, Comput. Stat. Data Anal., vol. 120, pp. 1–24, 2018.

31

G. K. Shahi, A. Dirkson, and T. A. Majchrzak, An exploratory study of COVID-19 misinformation on twitter, Online Soc. Netw. Media, vol. 22, p. 100104, 2021.

32
K. C. Yang, C. Torres-Lugo, and F. Menczer, Prevalence of low-credibility information on twitter during the COVID-19 outbreak, arXiv preprint arXiv: 2004.14484, 2020.
33
O. Tsur and A. Rappoport, What’s in a hashtag?: Content based prediction of the spread of ideas in microblogging communities, in Proc. 5th ACM Int. Conf. Web Search and Data Mining, Seattle, WA, USA, 2012, pp. 643–652.https://doi.org/10.1145/2124295.2124320
DOI
34

J. Y. Huang, Y. Y. Tang, Y. Hu, J. J. Li, and C. J. Hu, Predicting the active period of popularity evolution: A case study on twitter hashtags, Inf. Sci., vol. 512, pp. 315–326, 2020.

Publication history
Rights and permissions

Publication history

Received: 20 February 2021
Revised: 02 June 2021
Accepted: 26 June 2021
Published: 23 August 2021
Issue date: June 2021

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return