Kaggle Community Engagement Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides publicly available information concerning user performance and engagement within the Discussion section of Kaggle. It tracks and updates weekly the top 400 users, uniquely featuring a 'VotePerPost' ranking, which indicates the average number of votes each user's posts receive. The dataset aims to identify whose comments garner the most votes on the platform.
Columns
- date: The date when the ranking was recorded, in YYYY-MM-DD format.
- userID: A unique identifier for the user within Kaggle.
- currentRanking: The user's overall discussion ranking.
- votePerPostRanking: The user's ranking based on the average votes per post.
- tier: The user's recognised tier (e.g., EXPERT, MASTER).
- points: The total discussion points accrued by the user.
- totalGoldMedals: The overall count of gold medals awarded to the user.
- totalSilverMedals: The overall count of silver medals awarded to the user.
- totalBronzeMedals: The overall count of bronze medals awarded to the user.
- votesPerPost: Calculated as totalUpvotesCount divided by totalDiscussionsCount, representing votes per post.
- totalDiscussionsCount: The sum of totalTopicsCount and totalRepliesCount.
- totalTopicsCount: The number of discussion topics initiated by the user.
- totalRepliesCount: The total number of comments made by the current user.
- totalUpvotesCount: The aggregate number of upvotes received for the current user's comments and topics.
Distribution
The dataset is provided as a CSV file,
discussions.csv
, approximately 709.73 KB in size. It contains 14 columns and typically holds 8400 records. Data is updated weekly, focusing on the top 400 discussion users. The sample data spans from 30th November 2022 to 20th January 2023.Usage
This dataset is ideal for analysing user engagement and influence on Kaggle discussions. It can be used to identify high-performing users, track changes in user rankings and performance over time, and gain insights into the dynamics of online community interactions. Researchers and data analysts can leverage this data to study community behaviour, virality of content, and the impact of user contributions.
Coverage
The dataset focuses on the discussion section of Kaggle, a global platform. It covers the top 400 users from discussions and is updated weekly. The sample data includes dates from November 2022 to January 2023. User tiers such as 'EXPERT' and 'MASTER' are included, providing demographic scope within the Kaggle community.
License
CC0: Public Domain
Who Can Use It
Data scientists, researchers, and community managers interested in platform user behaviour, influence, and engagement metrics. Kaggle users looking to benchmark their performance or understand success factors in discussions. Anyone analysing online community dynamics.
Dataset Name Suggestions
- Kaggle Discussion User Rankings
- Kaggle Votes Per Post Analytics
- Kaggle Community Engagement Data
- Top Kaggle Discussion Users Performance
- Kaggle User Influence Metrics
Attributes
Original Data Source: Kaggle Community Engagement Data