Estimating tweet infectivity from the first 50 retweets is the key to predicting whether a tweet will go viral, according to a new study published in PLOS ONE on April 17, 2019 by Li Weihua from Beihang University, China and colleagues.
As online social networks and media continue to grow, so has the importance of understanding how they influence our thoughts and opinions. In particular, being able to predict the spread of social contagions is considered a key goal for those social information networks. Although models developed in the field of infectious diseases have been used to describe the spread of ideas, studies have not used real data to estimate how infectious the information is. The authors of the present study used about one month of Twitter data--comprising over 12 million tweets and more than 1.5 million retweets--and estimated each tweet's infectivity based on the network dynamics of the first 50 retweets associated with it. Then, they incorporated the infectivity estimates into a model with a decay constant that captures the gradual decline in interest as online information ages.
Using real data and simulations, the authors tested the ability of the infectivity-based model to predict the virality of retweet cascades, and compared its performance to that of the standard community model, which incorporates other predictive factors--such as social reinforcement and trapping effects that act to keep tweet cascades within small communities of connected users. They found that for both real Twitter data and simulated data, the infectivity model performed better than the community model, indicating that infectivity is a larger driving force in determining whether a tweet goes viral. Combining the two models into a hybrid community-infectivity model yielded the most accurate predictions, highlighting the complexity of the interacting forces that determine the life and death of social network information.
The authors add: "We propose a simulation model using Twitter data to show that infectivity, which reflects the intrinsic interestingness of an information cascade, can substantively improve the predictability of viral cascades."
Citation: Li W, Cranmer SJ, Zheng Z, Mucha PJ (2019) Infectivity enhances prediction of viral cascades in Twitter. PLoS ONE 14(4): e0214453. https://doi.org/10.1371/journal.pone.0214453
Funding: W.L. was supported by EPSRC Early Career Fellowship in Digital Economy (Grant No. EP/N006062/1). Z.Z. was supported by Program of National Natural Science Foundation of China Grant No. 11871004, Fundamental Research of Civil Aircraft Grant No. MJ-F-2012-04. P.J.M. was supported by the Eunice Kennedy Shriver National Institute of Child Health & Human Development of the National Institutes of Health under Award Number R01HD075712. The content is solely the responsibility of the authors and does not necessarily represent the official views of any of the agencies supporting this work.
Competing Interests: The authors have declared that no competing interests exist.
In your coverage please use this URL to provide access to the freely available article in PLOS ONE: http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0214453