In their study, “Credibility Ranking of Tweets during High Impact Events,” authors Aditi Gupta and Ponnurangam Kumaraguru “analyzed the credibility of information in tweets corresponding to fourteen high impact news events of 2011 around the globe.” According to their analysis, “30% of total tweets about an event contained situational information about the event while 14% was spam.” In addition, about 17% of total tweets contained situational awareness information that was credible.
The study analyzed over 35 million tweets posted by ~8 million users based on current trending topics. From this data, the authors identified 14 major events reflected in the tweets. These included the UK riots, Libya crisis, Virginia earthquake and Hurricane Irene, for example.
“Using regression analysis, we identied the important content and sourced based features, which can predict the credibility of information in a tweet. Prominent content based features were number of unique characters, swear words, pronouns, and emoticons in a tweet, and user based features like the number of followers and length of username. We adopted a supervised machine learning and relevance feedback approach using the above features, to rank tweets according to their credibility score. The performance of our ranking algorithm signicantly enhanced when we applied re-ranking strategy. Results show that extraction of credible information from Twitter can be automated with high confidence.”
The paper is available here (PDF). For more applied research on “information forensics,” please see this link.
See also:
- Analyzing Fake Content on Twitter During Boston Bombings [link]
- Predicting the Credibility of Disaster Tweets Automatically [link]
- Auto-Identifying Fake Images on Twitter During Disasters [link]
- How to Verify Crowdsourced Information from Social Media [link]
- Crowdsourcing Critical Thinking to Verify Social Media [link]
The most interesting question is what percentage was active disinformation
On a related note:
“Last year, seven and a half million emergency calls were made to the police in Britain. But fewer than a quarter of them turned out to be real emergencies, and many were pranks or fakes.”
Source: http://www.cbsnews.com/8301-504923_162-57449369/fake-911-calls-arent-cheap
Pingback: Predicting the Credibility of Disaster Tweets Automatically | iRevolution
Pingback: Analyzing the Veracity of Tweets during a Major Crisis | iRevolution
Pingback: Debating The Value of Tweets For Disaster Response (Intelligently) | iRevolution
Pingback: Tweet analyzer ranks trustworthy Tweets during emergencies | Tim Batchelder.com
Pingback: Tweeting is Believing? Analyzing Perceptions of Credibility on Twitter | iRevolution
Pingback: Humanitarianism in the Network Age: Groundbreaking Study | iRevolution
Pingback: Tweet analyzer ranks trustworthy Tweets during emergencies | Tim Batchelder.com
Pingback: Automatically Identifying Fake Images Shared on Twitter During Disasters | iRevolution
Pingback: Analyzing Fake Content on Twitter During Boston Marathon Bombings | iRevolution
Pingback: New Insights on How To Verify Social Media | iRevolution
Pingback: Got TweetCred? Use it To Automatically Identify Credible Tweets | iRevolution