March 2023 ChatGPT Tweet Sentiment Analysis
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides insights into public sentiment surrounding ChatGPT on Twitter during March 2023. It consists of approximately 100,000 English tweets that mention "chatgpt", collected between 18th March and 21st March 2023. The data includes the tweet content, engagement metrics, and pre-processed text along with sentiment labels (positive, neutral, negative) and their corresponding scores. Usernames are masked, and tags or links within tweets have been removed to protect privacy and focus on content analysis. This resource is ideal for analysing social media discourse and sentiment trends related to AI and natural language processing models.
Columns
- ID: A unique identifier for each tweet.
- Date: The date the tweet was sent.
- Username: The username of the person who tweeted, which has been masked, with non-real IDs generated for privacy.
- Tweet: The actual content of the tweet, with tags and links removed.
- ReplyCount: The number of replies a tweet received.
- RetweetCount: The number of times a tweet was retweeted.
- LikeCount: The number of likes a tweet accumulated.
- QuotesCount: The number of quotes a tweet received.
- Processed_tweet: The pre-processed version of the tweet content, prepared for sentiment analysis.
- Sentiment_label: The assigned sentiment category for the tweet (e.g., neutral, negative, positive).
- Sentiment_score: The numerical score associated with the assigned sentiment label.
- OnlyDate: The date value extracted from the 'Date' column.
- OnlyHour: The hour value extracted from the 'Date' column.
- OnlyMin: The minute value extracted from the 'Date' column.
Distribution
The dataset is provided in a CSV format and has a file size of 34.3 MB. It contains approximately 100,000 English tweets, with 98.8k valid records across all variables. Data for labelling and scoring has been pre-processed to be readily usable.
Usage
This dataset can be used for various applications, including:
- Analysing public opinion and sentiment towards ChatGPT.
- Tracking social media trends related to artificial intelligence.
- Developing and testing sentiment analysis models.
- Researching the societal impact and perception of new AI technologies.
- Marketing analysis to understand public reception of AI products.
Coverage
- Geographic Scope: Tweets are in English, implying a global reach where English is used on Twitter.
- Time Range: The tweets were collected between 18th March 2023 and 21st March 2023. Date counts within the dataset also show entries from 17th March 2023 to 22nd March 2023.
- Demographic Scope: Not explicitly detailed, but it reflects the general Twitter user base engaging with ChatGPT-related content. Usernames are masked for privacy.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Researchers: To conduct sentiment analysis studies, train machine learning models, and explore public discourse on AI.
- Marketing Professionals: To gauge public perception of ChatGPT, identify emerging trends, and inform communication strategies.
- AI Ethicists and Policy Makers: To understand societal responses and concerns regarding artificial intelligence advancements.
- Journalists and Media Analysts: To report on public sentiment and engagement with AI technologies.
Dataset Name Suggestions
- ChatGPT Twitter Sentiment Data (March 2023)
- March 2023 ChatGPT Tweet Sentiment Analysis
- Public Sentiment on ChatGPT: Twitter Data
- AI Discourse on Twitter: ChatGPT Edition (March 2023)
Attributes
Original Data Source: March 2023 ChatGPT Tweet Sentiment Analysis