Steam Global Bestsellers Dataset
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers a detailed look into best-selling games on the Steam platform, captured on 1 June 2025. It presents an extensive, global list of top-selling titles, collected without specific language filters to ensure a broad perspective. The dataset integrates information from three distinct sources – Steam, GameFAQs, and SteamDB – to provide a richer profile for each game. A key feature is the use of a standardised vocabulary of 42 unique genres and tags, ensuring consistency and a clean feature set for analysis, effectively representing each game with a minimal yet effective set of descriptors.
Columns
- game_name: The official title of the game.
- reviews_like_rate: The recommendation rate from user reviews on Steam, for example, '95% of the 100 reviews are positive'.
- all_reviews_number: The total count of user reviews a game has received on Steam.
- release_date: The official release date of the game on Steam, including initial release dates for Early Access titles.
- developer: The primary developer or studio responsible for the game.
- user_defined_tags: A set of genres and categories assigned by the Steam community, such as 'RPG' or 'Open World'. This dataset uses a curated vocabulary of 42 unique tags.
- supported_os: A list of operating systems officially supported by the game (e.g., win, mac, linux).
- supported_languages: A list of languages supported for the game's interface, audio, or subtitles.
- price: The game's price in MENA - U.S. Dollar, a regional currency. A value of 0 indicates a 'Free to Play' game.
- other_features: Features defined by Steam under 'player support', such as 'Single-player', 'Online PvP', or 'Steam Achievements'.
- age_restriction: The recommended age restriction for content, encoded as 0 (Everyone), 10 (10+), 13 (13+), or 17 (17+).
- rating: An overall user-provided rating for the game on a scale of 1 to 5, where 5 is the highest.
- difficulty: An estimated game difficulty as perceived by players, on a scale of 1 to 5, where 5 is the hardest.
- length: The average time in hours players spend to complete or fully experience the game, capped at 80 hours.
- estimated_downloads: The estimated total number of owners for the game.
Distribution
The dataset is in CSV format, consisting of 2,380 unique games and 15 columns. It is a composite dataset, merging information collected from the official Steam store, GameFAQs, and SteamDB. The data file is approximately 792.36 kB.
Usage
This dataset is ideal for analysing trends in the Steam gaming market, understanding factors contributing to a game's success, and exploring pricing strategies. It can be used for classification and regression tasks in machine learning, and for research into game development, player engagement, and content categorisation.
Coverage
The data was collected on 1 June 2025, offering a snapshot of top-sellers worldwide, with no specific language filters applied during collection. The release dates of games within the dataset range from 3 August 1994 to 30 May 2025. Data for age restriction, rating, difficulty, and length from GameFAQs was estimated from Steam user reviews when unavailable. Only games for which an estimated_downloads value was obtainable from SteamDB are included in the final dataset.
License
CC BY-SA 4.0
Who Can Use It
This dataset is suitable for game developers keen on market research, data scientists building predictive models, academic researchers studying the gaming industry, and gaming enthusiasts interested in market dynamics and player behaviour.
Dataset Name Suggestions
- Steam Global Bestsellers Dataset
- Enriched Steam Games Analytics
- Top Selling Steam Games Data
- Steam Market Insights Dataset
- Global Steam Game Metrics
Attributes
Original Data Source: Steam Global Bestsellers Dataset