Social Movement 2017 Twitter Archive
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A collection of over 28,000 tweets posted on Twitter (now X) during the height of the Me Too movement, primarily spanning the period around 2017–2018. The data offers valuable insights into public discussion, emotional responses, and the spread of content related to this significant social movement, providing a foundation for studying social media activism and global discourse.
Columns
The dataset includes nine distinct fields detailing tweet attributes and content:
- tweetid: The unique identifier assigned to each individual tweet record.
- created: The specific date and time when the tweet was posted.
- text: The raw text content of the tweet itself.
- retweets: The total number of retweets received by the post.
- favorites: The total count of users who marked the tweet as a 'favourite' or liked it.
- source.r: The operating system or device source from which the tweet was posted (e.g., IPHONE, ANDROID).
- hashtag: Any mentions or hashtags present within the tweet.
- num.emojis: A count of the number of emojis used in the tweet text.
- emoji_names: The formal name of the emojis used in the content.
Distribution
The data is provided in a standard file format, specifically MeToo_Tweets.csv, with a size of 4.51 MB. The structure contains approximately 28,600 valid records. The expected update frequency for this archive is never, as it captures a specific historical window of activity.
Usage
This archive is ideally suited for tasks such as sentiment analysis on social issues, tracking the spread and virality of social movements, studying temporal patterns in hashtag use, and developing Natural Language Processing (NLP) models focused on political or cultural discourse.
Coverage
The temporal scope of the data centres around the years 2017 to 2018, documenting activity during the critical emergence and growth of the Me Too movement. Detailed timestamp analysis shows significant posting density, particularly around 16 and 17 October 2017. The data captures content generated by various Twitter users, with device analysis indicating that IPHONE (43%) and ANDROID (20%) are the most frequently used platforms for posting.
License
CC0: Public Domain
Who Can Use It
- Social Scientists: To examine public responses and mobilisation strategies within the digital sphere.
- Data Scientists: For training models that detect trending topics, virality, or emotional tone in short-form social media text.
- Media and Cultural Scholars: To analyse the role of platforms like Twitter/X in framing global conversations about gender and power.
Dataset Name Suggestions
MeToo Tweets Dataset
Social Movement 2017 Twitter Archive
#MeToo Conversation Data
Historical Twitter Discourse 2017-18
Attributes
Original Data Source:Social Movement 2017 Twitter Archive