COVID-19 Vaccine Twitter Sentiment
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures public sentiment and behavioural changes in India related to the COVID-19 vaccination programme. It was extracted from Twitter during a period of considerable discussion and public skepticism surrounding vaccine validity and their export amidst domestic shortages. The data can be utilised to identify major events occurring within the country during this timeframe and to analyse shifts in public behaviour as a direct result. It may also provide insights into the demand for vaccination during this period.
Columns
The dataset includes the following columns, designed to be straightforward to interpret:
url
: The URL of the tweet.date
: The date and time the tweet was posted.content
: The original text content of the tweet.renderedContent
: The processed content of the tweet.id
: A unique identifier for the tweet.user
: Information about the user who posted the tweet.replyCount
: The number of replies to the tweet.retweetCount
: The number of retweets of the tweet.likeCount
: The number of likes the tweet received.quoteCount
: The number of times the tweet was quoted.conversationId
: The ID of the conversation thread the tweet belongs to.lang
: The language of the tweet.source
: The client used to post the tweet (e.g., "Twitter for Android").sourceUrl
: The URL associated with the tweet's source.sourceLabel
: A label for the tweet's source.outlinks
: External links included in the tweet.tcooutlinks
: Twitter shortened URLs in the tweet.media
: Information about any media attached to the tweet.retweetedTweet
: Details if the tweet is a retweet.quotedTweet
: Details if the tweet is a quoted tweet.inReplyToTweetId
: The ID of the tweet this one is in reply to.inReplyToUser
: Information about the user this tweet is in reply to.mentionedUsers
: Users mentioned in the tweet.coordinates
: Geographic coordinates associated with the tweet.place
: Geographic place information for the tweet.hashtags
: Hashtags used in the tweet.cashtags
: Cashtags used in the tweet.
Distribution
The data files are typically provided in CSV format. A sample file will be made available separately on the platform. Specific row or record counts for this dataset are not currently provided.
Usage
This dataset is ideally suited for:
- Social media sentiment analysis on public health topics.
- Tracking public opinion and behavioural changes during national events.
- Identifying key events and their impact on public discourse regarding vaccinations.
- Analysing the perceived demand for vaccination in India during the specified period.
- Applications in data science and analytics, especially for natural language processing (NLP) tasks such as text cleaning and messaging analysis.
Coverage
The dataset primarily covers social media discourse within India concerning COVID-19 vaccination. The tweet data included spans a period in early 2021, specifically from February to March. Geographical locations associated with the tweets include cities like Pune, Nellore, Perinthalmanna, Bengaluru, and Haveli.
License
CC0
Who Can Use It
This dataset is intended for data scientists, researchers, public health analysts, and organisations involved in tracking public sentiment and behavioural trends related to health initiatives. It is particularly useful for those studying social dynamics during a pandemic and the public reception of vaccination programmes in India.
Dataset Name Suggestions
- Indian COVID-19 Vaccine Twitter Sentiment
- India Vaccine Public Opinion Dataset
- COVID-19 Vaccination Discourse in India
- Indian Vaccine Sentiment Twitter Data
- India Public Health Twitter Analysis
Attributes
Original Data Source: Twitter sentiment analysis