Global COVID-19 Vaccine Sentiment and Tweet Archive
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Analysing global sentiment during the peak of the COVID-19 immunisation drive provides critical insights into public health communication and vaccine hesitancy. This collection captures real-time reactions and discussions surrounding major vaccines such as Pfizer/BioNTech, Sinopharm, Sinovac, Moderna, Oxford/AstraZeneca, Covaxin, and Sputnik V. By examining a vast array of social media interactions, researchers can identify emerging narratives and the specific concerns or endorsements shared by a global audience during the roll-out of these medical interventions.
Columns
- user_name: The Twitter handle of the individual or organisation posting the content.
- user_location: The self-reported geographic origin of the user, which is unvalidated and can include non-geographic descriptions.
- user_description: A public biography written by the account holder to describe their interests or identity.
- user_created: The specific timestamp indicating when the user's Twitter account was first established.
- user_followers: The total count of accounts currently following this user.
- user_friends: The number of accounts this user is following.
- user_favourites: The total number of posts the user has liked on the platform since account creation.
- user_verified: A boolean indicator showing whether the account has been authenticated as a well-known figure or entity.
- id: The unique numerical identifier for the specific post assigned by the Twitter API.
- date: The calendar date and time the content was published, recorded as a DateTime object.
- text: The core content of the post, representing the most vital information for sentiment and thematic analysis.
- hashtags: A list of the specific hashtags included in the post to categorise the content for search.
- source: The device or application interface used to publish the tweet.
- retweets: The number of times the post was shared by others at the time of data collection.
- favorites: The total number of likes received by the tweet at the time of extraction.
- is_retweet: A boolean flag indicating if the post is original or a shared update from another user.
Distribution
The information is delivered in a single CSV file titled
vaccination_tweets.csv with a file size of approximately 2.39 MB. It contains 5,760 valid records, showing 100% integrity for core fields like the tweet ID and user names. While demographic data like location and description have some missing entries, the textual data is fully preserved. The dataset is managed with an expected monthly update frequency to capture the evolving nature of the discourse.Usage
This resource is ideal for conducting sentiment analysis to track public trust and emotional responses to specific vaccine brands over time. It is well-suited for building natural language processing models to classify health-related misinformation versus official reporting. Additionally, educational researchers can use the follower and retweet metrics to perform social network analysis and identify the most influential voices in the global vaccination conversation.
Coverage
The geographic scope is global, reflecting any user worldwide who utilised relevant search terms for the major vaccines, though the self-reported nature of location data provides a varied level of detail. Temporally, the records span from December 2020 to February 2021, documenting the critical early stages of the global vaccination campaigns. The demographic coverage consists of active social media users engaging with public health topics in English.
License
CC0: Public Domain
Who Can Use It
Public health analysts can leverage these records to monitor vaccine sentiment and refine communication strategies to address public concerns. Data scientists may utilise the text-rich descriptions and tweet content to practice clustering and topic modelling techniques. Furthermore, sociologists can explore the data to study the spread of digital health narratives and the impact of verified accounts on public perception.
Dataset Name Suggestions
- Global COVID-19 Vaccine Sentiment and Tweet Archive
- International Vaccination Discourse: Twitter Response Metrics
- Public Health Narratives: Pfizer, Moderna, and AstraZeneca Sentiment Data
- COVID-19 Immunisation Campaign: Social Media Engagement Records
- Real-time Vaccine Sentiment Tracking via Twitter API
Attributes
Original Data Source: Global COVID-19 Vaccine Sentiment and Tweet Archive
Loading...
Free
Download Dataset in CSV Format
Recommended Datasets
Loading recommendations...
