Opportunities of COVID-19 (mis)information

Feb 23, 2021 by Thibault Debatty | 2619 views

https://cylab.be/blog/133/opportunities-of-covid-19-misinformation

The ongoing COVID-19 crisis is being discussed a lot on social media platforms. Researchers and social media platforms a like make use of the online conversation to increase their situational awareness about the continuously evolving situation. At the same time, foreign powers or special interest groups have also been observed of piggybacking the large scale discussion to spread fake news and/or misinformation.

In the context of our ongoing Social Media Intelligence (SOCMINT) project, the current situation does however create several opportunities:

Large-scale datasets are available. Twitter Developer Labs even created a specific COVID-19 stream endpoint: https://developer.twitter.com/en/docs/labs/covid19-stream/overview.

Given the global importance of the subject, multiple teams of researchers have actively engaged in manually labeling misinformation. This has lead to the existence of a multitude of annotated datasets in different languages (although most datasets are in English). The existence of multiple instances of a somewhat reliable ground truth is a luxury one rarely comes across in this domain:

CoAID COVID-19 healthcare misinformation dataset: https://github.com/cuilimeng/CoAID
FakeCovid dataset: https://gautamshahi.github.io/FakeCovid/
“Characterizing COVID-19 Misinformation Communities Using a Novel Twitter Dataset” provides a list of annotated COVID datasets: https://arxiv.org/abs/2008.00791

These datasets will be used to further test and evaluate ongoing research into different methods for automated misinformation detection. Furthermore, we examine whether the conclusions that can be drawn at a global level can also be applied specifically to Belgium.

Discover more on our SOCMINT project

This blog post is licensed under CC BY-SA 4.0 creative commons attribution share-alike