Pro- and Anti-Brexit Twitter Discourse 2022
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A large collection of tweets and corresponding user biodata focuses on Brexit sentiment gathered between January and March 2022. The data was collected as part of a Master's dissertation project, classifying users into 'pro-Brexit' and 'anti-Brexit' accounts based on publicly declared political positions found in their Twitter biographies. The resource captures the polarity of the discourse, featuring over 19,000 unique tweets and associated metadata, making it valuable for the analysis of political polarity, engagement patterns, and language use related to European topics.
Columns
The dataset contains 38 columns detailing both tweet content and user characteristics:
- Date, Alternate Date Format, Time: The exact time and date the tweet was posted.
- URL, User Profile Url: Direct links to the tweet and the user's Twitter profile.
- Hit Sentence: The text content of the tweet itself.
- Influencer, Twitter Screen Name: The Twitter handle and displayed name of the user who shared the tweet.
- Twitter Bio: The text from the profile bio, which was instrumental in determining the user's Brexit leaning (pro or anti).
- Country, Subregion, State, City: Geographic location information declared by the user.
- Language: Automated language detection, found to be English in 99% of records.
- Reach, Twitter Followers, Twitter Following: Metrics reflecting the potential audience of the tweet and the user’s network size.
- Engagement: The total sum of public interactions for the tweet (likes, retweets, and quote tweets).
- Sentiment: The result of automated sentiment detection, showing Neutral as the most frequent label, followed by Negative.
- Keywords: Key terms extracted from the text, with "Brexit" and "EU" being the most common.
- Twitter Authority: A rank assigned to the account based on influence.
- Twitter Client: The device used to post the tweet, such as Twitter for Android or Twitter for iPhone.
Distribution
The collection consists of approximately 211,000 records of Brexit Polarity Tweets, with an overall usability rating of 10.00. The data file is titled TweetDataset_AntiBrexit_Jan-Mar2022.csv and is 165.5 MB in size. The data spans a fixed period and is not expected to receive future updates.
Usage
This resource is suited for:
- Investigating political polarisation and the dynamics of online political conversations.
- Performing Natural Language Processing (NLP) tasks, including sentiment analysis focused on specific political events.
- Conducting social network analysis to map influence and reach within the defined pro- and anti-Brexit cohorts.
- Researching geographical trends in Twitter discussions about Europe.
Coverage
The temporal coverage extends from 1 January 2022 to 31 March 2022. The data is derived from accounts with publicly defined political positions on Brexit. Geographically, the data is heavily concentrated in the United Kingdom, which accounts for 48% of records with a declared location, followed by a large percentage of unknown locations. The vast majority of the text is in the English language.
License
CC0: Public Domain
Who Can Use It
- Academic Researchers: To study modern digital political movements, political communication, and public opinion shifts.
- Data Scientists: For training predictive models related to social media behaviour and content classification.
- Political Analysts: To identify trending topics, key influencers, and real-time shifts in political discourse among polarized groups.
Dataset Name Suggestions
- Brexit Polarity Tweets (Jan-Mar 2022)
- Pro- and Anti-Brexit Twitter Discourse 2022
- UK Political Sentiment on Twitter: Early 2022
Attributes
Original Data Source:Pro- and Anti-Brexit Twitter Discourse 2022
Loading...
