Unveiling Global Narratives: A Multilingual Twitter Dataset of News Media on the Russo-Ukrainian Conflict
Date:
Presented at International Conference on Multimedia Retrieval (ICMR) 2024
The ongoing Russo-Ukrainian conflict has been a subject of intense media coverage worldwide. Understanding the global narrativesurrounding this topic is crucial for researchers that aim to gaininsights into its multifaceted dimensions. In this paper, we presenta novel multimedia dataset that focuses on this topic by collectingand processing tweets posted by news or media companies on socialmedia across the globe. We collected tweets from February 2022 toMay 2023 to acquire approximately 1.5 million tweets in 60 different languages along with their images. Each entry in the dataset isaccompanied by processed tags, allowing for the identification ofentities, stances, textual or visual concepts, and sentiment. The availability of this multimedia dataset serves as a valuable resource forresearchers aiming to investigate the global narrative surrounding theongoing conflict from various aspects such as who are the prominententities involved, what stances are taken, where do these stancesoriginate from, how are the different textual and visual conceptsrelated to the event portrayed