Twitter User Profile Data
Social Media and Networking
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information on social media users, specifically those from Twitter. It was created using the Tweepy API and is a foundational resource for understanding user behaviour and network characteristics. The dataset is suitable for analysing user profiles and their publicly available activities, offering insights into various user attributes.
Columns
- Index: A unique identifier for each user.
- Name: The name of the user, which has been hidden due to privacy reasons.
- Followers_count: The total number of followers associated with the user's account.
- Verified: A boolean indicator specifying whether the user's account is verified.
- Protected: A boolean indicator showing if the user's account is private or public.
- Location: The self-reported geographical location of the user.
- Status_count: The total number of tweets posted by the user.
- Description: The biographical information or profile description provided by the user.
Distribution
The data files are typically provided in CSV format. This dataset contains approximately 2,065 individual user records. Sample files will be updated separately to the platform.
Usage
This dataset is ideal for:
- Analysing social media user behaviour, patterns, and trends.
- Developing and testing Natural Language Processing (NLP) models, particularly on user biographies and profile descriptions.
- Conducting research into user verification statuses and account privacy settings.
- Exploring the geographical distribution and self-reported locations of users.
- Building comprehensive user profiles for targeted analysis or application development.
Coverage
The dataset's geographic scope is global, encompassing users from various regions around the world. Notable geographic data includes specific locations reported by users, with India being an example of a significant represented region. The dataset represents a snapshot of user information; the exact time range of data capture is not specified. Demographic coverage is limited to publicly accessible user profile information.
License
CC0
Who Can Use It
This dataset is valuable for:
- Data Scientists who aim to build models related to user engagement and behaviour.
- Researchers focusing on online social networks and digital demographics.
- Developers requiring user profile information for integration into applications.
- Students learning about data analysis, social media data, and NLP techniques.
Dataset Name Suggestions
- Twitter User Profile Data
- Social Media User Analytics
- Tweepy User Statistics
- Global Twitter User Data
Attributes
Original Data Source: Twitter Dataset