Discussions Tier Achievement Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This collection details the user rankings, achievement statistics, and metadata for individuals participating in the Discussions Tier on the Kaggle platform. It serves as a valuable resource for analysts and data scientists interested in social network metrics, ranking system dynamics, and monitoring user influence and longevity in online data science environments. The usability score for this data is 10.00.
Columns
- CurrentRanking: The numerical discussion rank held by the user, ranging from 1 to 2343, with a calculated mean rank of 1.17 thousand.
- DisplayName: The user’s display name on the platform; 2337 unique values are present.
- UserName: The unique identifier for the user account. All 2343 records feature unique usernames.
- UserId: The numerical identifier for the user, ranging from 381 up to 9.29 million.
- Tier: The user’s discussion tier level. The minimum observed tier is 2, the maximum is 4, and the mean tier level is 2.08. The vast majority of users (2,210) fall within the Tier 2 grouping.
- Points: The total number of points accumulated by the user. The values range from 1 up to 8.84 thousand, with a mean of 140 points.
- RegisterDate: The date the user registered on the platform. The dates span from 30 January 2010 to 30 December 2021.
- TotalGold: The count of gold medals awarded to the user. The maximum count observed is 410.
- TotalSilver: The count of silver medals awarded to the user. The maximum count observed is 539.
- TotalBronze: The count of bronze medals awarded to the user. Values range from 19 up to 7,444.
Distribution
The data is provided in a tabular format as a CSV file, named
DiscussionRankings.csv, with a file size of 171.98 kB. It contains 10 distinct columns and 2343 valid records for most fields. Important to note is the data sparsity in medal counts: the TotalGold column has 910 missing records (39% missing), and TotalSilver is missing for 529 records (23% missing).Usage
Ideal applications include charting user progress and achievement over time, particularly for analyses similar to the original projects: "What Are You Talking About?" and "Charting User Progress - Discussions." The data is suitable for predictive modelling of user success or tier progression, analysis of medal achievement distributions, and studying platform engagement trends.
Coverage
The scope covers participants specifically within the Discussions Tier of the Kaggle ML and Data Science Community. The temporal range covered spans from the earliest registration date of 30 January 2010 through to the latest recorded date of 30 December 2021.
License
CC0: Public Domain.
Who Can Use It
This dataset is useful for data scientists, machine learning researchers, and community managers who are seeking to understand user ranking methodologies, activity metrics, and the distribution of medal achievements across various discussion tiers. Analysts focusing on computer science and online social structures will find this data pertinent.
Dataset Name Suggestions
- Kaggle Discussion User Rankings
- ML Community Discussion Metrics
- User Progression Data (Kaggle Discussions)
- Discussions Tier Achievement Data
Attributes
Original Data Source:Discussions Tier Achievement Data
Loading...
