Corporate Twitter Mentions Time Series
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
Time-series dataset capturing the volume of Twitter mentions for several large, publicly traded companies. The data tracks the number of mentions for each company's ticker symbol in five-minute intervals. It is part of the Numenta Anomaly Benchmark (NAB), an open-source project designed for evaluating real-time, streaming anomaly detection algorithms. This dataset is ideal for analysis, anomaly detection studies, and understanding social media trends related to major corporate brands.
Columns
- timestamp: The date and time when the mention volume was calculated.
- Apple: The number of Twitter mentions for Apple.
- Amazon: The number of Twitter mentions for Amazon.
- Salesforce: The number of Twitter mentions for Salesforce.
- CVS: The number of Twitter mentions for CVS.
- Facebook: The number of Twitter mentions for Facebook.
- Google: The number of Twitter mentions for Google.
- IBM: The number of Twitter mentions for IBM.
- Coca-Cola: The number of Twitter mentions for Coca-Cola.
- Pfizer: The number of Twitter mentions for Pfizer.
- UPS: The number of Twitter mentions for UPS.
Distribution
The dataset is provided in a single CSV file (
dataset.csv
) with a size of approximately 958.47 kB. It contains 11 columns, including a timestamp and mention volumes for ten different companies. The total number of records is approximately 15,900.Usage
This dataset is well-suited for a variety of applications, including:
- Time Series Analysis: Analysing trends and patterns in social media mentions over time.
- Anomaly Detection: Identifying unusual spikes or dips in mention volume for specific companies, which could correlate with real-world events.
- Financial Market Analysis: Exploring potential correlations between social media sentiment and stock market performance.
- Marketing and Brand Management: Monitoring brand visibility and public conversation on social media platforms.
Coverage
- Geographic: The data is sourced from Twitter, a global platform, so coverage is international. However, no specific geographic breakdown is provided.
- Time Range: The data spans from 26 February 2015 to 23 April 2015.
- Demographic: The dataset focuses on company mentions and does not contain demographic information about the Twitter users. It includes data for Apple, Amazon, Salesforce, CVS, Facebook, Google, IBM, Coca-Cola, Pfizer, and UPS.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For building and testing anomaly detection models or conducting time-series analysis.
- Financial Analysts and Quants: To investigate the relationship between social media chatter and financial indicators.
- Marketing Professionals: To track brand mentions and analyse the impact of marketing campaigns.
- Academic Researchers: For studies in social media analytics, computational social science, and business intelligence.
Dataset Name Suggestions
- Corporate Twitter Mentions Time Series
- NAB: Publicly Traded Company Twitter Volumes
- Real-time Anomaly Detection: Corporate Mentions
- Social Media Brand Mentions 2015
Attributes
Original Data Source: Corporate Twitter Mentions Time Series