Iran Social Movement Twitter Archive
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures tweets related to the Iran protests sparked by the tragic death of Mahsa Amini. It consists of content collected using Tweepy, specifically tweets containing the hashtag #IranProtests2022. The collection process was initially incremental, updated daily, but automatic updates have unfortunately been discontinued due to changes in the Twitter API's free tier. This dataset offers insights into a significant and widely discussed topic on Twitter.
Columns
- user_name: The display name of the Twitter user.
- name: The user's unique account name.
- user_location: The self-declared location of the user, which may include specific cities like Washington, DC, or be null.
- user_description: The biographical text provided by the user in their profile, often containing relevant hashtags like #FreeIran and #MahsaAmini.
- user_created: The date and time when the Twitter account was originally created, spanning from October 2006 to May 2023.
- user_followers: The total count of followers an individual Twitter user has, ranging from 0 to over 53 million.
- user_friends: The number of accounts a user is following, with values up to 315,000.
- user_favourites: The total number of tweets a user has marked as liked, with values reaching nearly 1.5 million.
- user_verified: A boolean indicator showing whether the Twitter account is verified.
- date: The timestamp when the tweet was posted, covering a period from September 2022 to June 2023.
- text: The full content of the tweet message, offering a wide range of unique values.
- hashtags: A list of hashtags included within the tweet text, such as ['MahsaAmini'].
- source: The application or client used to post the tweet, with common examples including Twitter for Android and Twitter for iPhone.
Distribution
The dataset is provided as a CSV file named
tweets.csv
, with a size of 258.66 MB. It contains 12 distinct columns and comprises approximately 590,000 valid records.Usage
This dataset is ideal for social issues and advocacy research. It can be used for analysing public sentiment surrounding the Iran protests, tracking evolving discourse, identifying key hashtags and influential accounts, and understanding the spread of information related to social movements and human rights issues on Twitter.
Coverage
The dataset primarily covers tweets related to the Iran protests sparked by Mahsa Amini's death. The tweet timestamps range from 23rd September 2022 to 10th June 2023. User account creation dates extend much further back, from 6th October 2006 to 31st May 2023. Geographic coverage is implicitly global, reflecting the diverse locations of Twitter users discussing the protests, though specific location data is often absent.
License
CC0: Public Domain
Who Can Use It
This dataset is particularly useful for researchers studying social media dynamics, data scientists interested in text analysis and sentiment, journalists covering global current events and human rights, and advocacy groups monitoring and understanding social movements. It allows for detailed investigations into how significant political and social events are discussed and unfold online.
Dataset Name Suggestions
- Iran Protests 2022-2023 Twitter Data
- Mahsa Amini Protests Tweets
- Iran Social Movement Twitter Archive
- Tweets on Mahsa Amini Protests
- Iran Human Rights Discourse on Twitter
Attributes
Original Data Source: Iran Social Movement Twitter Archive