Russia-Ukraine War Social Media Posts
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures the daily discourse from the r/UkrainianConflict subreddit, focusing on news and events surrounding Russia's invasion of Ukraine. It comprises collected posts and comments, offering valuable insights into public sentiment and discussions regarding the ongoing conflict. The data is collected and merged daily using the praw Python package, with content created by Reddit contributors.
Columns
- title: The title of the Reddit post.
- Comment: The content of the comment or post, with 79% of entries being valid. An example includes "Russians plunder $5M farm vehicles from Ukraine -- to find they've been remotely disabled".
- score: The score (upvotes minus downvotes) of the post or comment, ranging from -143 to 25,200. The mean score is 52.3 with a standard deviation of 304.
- id: A unique identifier for each post or comment. There are 253,494 unique IDs in the dataset.
- url: The URL associated with the Reddit post, with 21% of entries being valid. A common URL observed is https://liveuamap.com/.
- comms_num: The number of comments associated with a given post. Values range from 0 to 3,620, with a mean of 5.83.
- created: The Unix timestamp indicating when the post or comment was created, ranging from 1.65 billion to 1.67 billion.
- body: The main body text of the post or comment. About 79% of entries are valid.
- timestamp: The human-readable date and time indicating when the post or comment was created, spanning from 21st March 2022 to 7th October 2022.
Distribution
The dataset is provided as a CSV file,
russian_invasion_of_ukraine.csv
, with a size of 66.9 MB. It contains approximately 253,000 records, representing a substantial collection of Reddit posts and comments.Usage
This dataset is ideal for understanding the daily unfolding events of the Russian invasion of Ukraine. It can be utilised to perform sentiment analysis on posts and comments, allowing for insights into public opinion. Furthermore, it supports topic modelling to extract prevailing themes and subjects discussed within the subreddit.
Coverage
The dataset's focus is on the Russian Invasion of Ukraine, drawing content from the r/UkrainianConflict subreddit. The time range covered is from 21st March 2022 to 7th October 2022. The data reflects the discourse of Reddit contributors to this specific subreddit.
License
CC0: Public Domain
Who Can Use It
This dataset is suitable for researchers studying geopolitical conflicts, data scientists interested in social media analytics and natural language processing, journalists tracking public discourse on the war, and political analysts seeking to understand shifts in sentiment and key discussion points related to the Ukrainian conflict.
Dataset Name Suggestions
- Ukrainian Conflict Reddit Discourse
- Russia-Ukraine War Social Media Posts
- r/UkrainianConflict Daily Data
- Ukraine Invasion Reddit Discussion Analytics
Attributes
Original Data Source:Russia-Ukraine War Social Media Posts