Opendatabay APP

Worldwide Tweets on Ukraine Crisis

Government & Civic Records

Tags and Keywords

Online

Communities

Tabular

Text

Nlp

Russia

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Worldwide Tweets on Ukraine Crisis Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides a collection of tweets from around the world focusing on the Russia-Ukraine conflict. It offers valuable social media data for researchers and analysts interested in public discourse, sentiment, and trends related to this geopolitical event. The data, primarily in tabular format, is suitable for various analytical approaches, including natural language processing.

Columns

  • id: Unique identifier for each message.
  • conversation_id: Identifier to group messages belonging to the same conversation thread.
  • created_at: Date and time when the tweet was created.
  • date: Date when the tweet was created.
  • time: Time when the tweet was created.
  • timezone: Time zone of the user who posted the tweet.
  • user_id: Unique identifier for the Twitter user.
  • username: The username of the Twitter account.
  • name: The display name of the Twitter account.
  • place: The location associated with the tweet, if provided.
  • tweet: The full text content of the tweet.
  • language: The detected language of the tweet.
  • mentions: List of user mentions within the tweet.
  • urls: List of URLs included in the tweet.
  • photos: List of photo URLs attached to the tweet.
  • replies_count: The number of replies to the tweet.
  • retweets_count: The number of retweets the tweet received.
  • likes_count: The number of likes the tweet received.
  • hashtags: List of hashtags used in the tweet.
  • cashtags: List of cashtags used in the tweet.
  • link: The direct URL to the tweet.
  • retweet: Boolean indicating if the tweet is a retweet.
  • quote_url: URL of the quoted tweet, if applicable.
  • video: Boolean indicating if the tweet contains a video.
  • thumbnail: URL of the video thumbnail, if a video is present.
  • near: Approximate geographic location (textual description).
  • geo: Geographic coordinates of the tweet.
  • source: The application or client used to post the tweet.
  • user_rt_id: User ID of the original tweeter if this is a retweet.
  • user_rt: Username of the original tweeter if this is a retweet.
  • retweet_id: ID of the original tweet if this is a retweet.
  • reply_to: List of IDs of tweets this tweet is a reply to.
  • retweet_date: Date of the retweet.
  • translate: Boolean indicating if the tweet has been translated.
  • trans_src: Source language of the translation.
  • trans_dest: Destination language of the translation.

Distribution

The dataset is typically provided in a CSV file format. It contains tens of thousands of tweets, with various ranges showing record counts from over 1,500 to over 5,000 tweets per segment. The total number of unique tweet IDs is approximately 41,926, and the overall dataset size is around 43,000 to 44,000 records.

Usage

This dataset is ideal for:
  • Sentiment Analysis: Understanding public sentiment towards the Russia-Ukraine conflict.
  • Trend Analysis: Identifying emerging topics, narratives, and trends in social media discussions.
  • Social Media Monitoring: Tracking discourse and reactions to significant geopolitical events.
  • Linguistic Research: Analysing language patterns and communication strategies related to conflict.
  • Public Opinion Research: Gaining insights into global public perspectives on the war.

Coverage

The dataset offers global coverage, collecting tweets from around the world. The time range for the tweets primarily spans from early July 2022, specifically from 3rd July 2022 to 12th July 2022, with some records extending to 13th July 2022. No specific demographic breakdown is provided, but the global coverage suggests a diverse set of contributors.

License

CC0

Who Can Use It

  • Researchers: For academic studies on geopolitics, social media analysis, and public sentiment.
  • Data Scientists/Analysts: To build and test natural language processing models, perform data mining, and extract insights.
  • Journalists: To inform reporting and understand public reaction to current events.
  • Government and Policy Analysts: To gauge public opinion and assess the impact of events.
  • Non-Governmental Organisations (NGOs): For understanding social narratives surrounding humanitarian and political crises.

Dataset Name Suggestions

  • Global Russia-Ukraine Conflict Tweets
  • Social Media Discourse: Russia-Ukraine War
  • Ukraine Conflict Tweets (July 2022)
  • Geopolitical Tweet Analysis Dataset
  • Worldwide Tweets on Ukraine Crisis

Attributes

Original Data Source: Tweets on Russia Ukraine Conflict

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

22/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free