COVID-19 Vaccine Tweets: Bharat Biotech
Health Information Systems & Technology
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a collection of over 200 tweets posted by Bharat BioTech concerning their Covaxin COVID-19 vaccine. It is designed to offer insights into the public communication strategies of a key biotechnology company during a significant global health event. The dataset is available in two forms: an original non-cleaned CSV file with 36 columns and a refined, cleaned CSV file tailored for various Natural Language Processing (NLP) tasks. This resource is valuable for understanding public discourse and corporate communication related to vaccine development and deployment in India.
Columns
The refined (cleaned) CSV file, which is optimised for analytical tasks, includes the following key columns:
- Username: The Twitter account's username.
- Name: The display name associated with the Twitter account.
- Tweets: The full text content of the tweets published by BharatBioTech about Covaxin.
- Lang: The language in which each tweet was written.
Distribution
The dataset is provided in CSV format. It contains over 200 tweets, with approximately 230 unique entries when considering ID ranges. The tweet count varies across different periods: 2 tweets from June 2014 to February 2015, 58 from February to October 2019, 90 from October 2019 to May 2020, and 88 from May 2020 to January 2021. Additionally, there are numerous tweets recorded specifically on January 28, 2021, with counts such as 6, 44, 19, 15, 16, 38, 44, 18, 31, and 7 for different segments of that day.
Usage
This dataset is ideally suited for a wide array of analytical applications, particularly those involving Natural Language Processing. It can be utilised for:
- Sentiment analysis on public and corporate communication regarding vaccines.
- Tracking information dissemination during public health crises.
- Linguistic analysis of corporate messaging.
- Developing and testing NLP models for tweet analysis.
Coverage
The geographic scope of the tweets is centred around India, given Bharat BioTech's origin and focus. The temporal coverage spans multiple years, providing a historical perspective on the company's social media activity related to Covaxin. The tweets range from 27 June 2014 to 29 January 2021. Key periods include:
- 27 June 2014 - 22 February 2015
- 5 February 2019 - 3 October 2019
- 3 October 2019 - 31 May 2020
- 31 May 2020 - 27 January 2021
- Extensive activity on 28 January 2021
License
CCO
Who Can Use It
This dataset is highly relevant for:
- Data scientists and machine learning engineers working on NLP models and text classification.
- Researchers in public health, sociology, and communication studies interested in vaccine hesitancy, information spread, or corporate social responsibility.
- Public policy analysts tracking responses to health initiatives.
- Students undertaking projects involving social media data analysis or health informatics.
Dataset Name Suggestions
- BharatBioTech Covaxin Tweets
- COVID-19 Vaccine Tweets: Bharat Biotech
- Covaxin Twitter Data for NLP
- BharatBioTech Social Media Insights
Attributes
Original Data Source: COVID-19 BharatBioTech Tweets