Data Science Book Customer Ratings Dataset
Reviews & Ratings
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers a collection of customer reviews and ratings for data science-related books sourced from Amazon. It provides a valuable resource for understanding customer sentiment and the overall reception of various publications within the data science domain. The collection includes 20,647 individual reviews covering 836 distinct data science books. Each entry features the raw review text and a corresponding star rating, ranging from 1 to 5.
Columns
- stars: Represents the customer's rating for the book, indicated by a number from 1 to 5, where 5 is the highest possible score.
- comment: Contains the textual content of the customer's review.
- book_url: Provides the direct web address to the specific book's product page on Amazon.
Distribution
The dataset is structured as a CSV file and comprises 20,647 reviews. These reviews relate to 836 unique books identified through "Data Science" searches on Amazon. The star ratings are distributed across several bands:
- 1.00 - 1.20 stars: 1,430 reviews
- 2.00 - 2.20 stars: 929 reviews
- 3.00 - 3.20 stars: 1,431 reviews
- 4.00 - 4.20 stars: 2,894 reviews
- 4.80 - 5.00 stars: 13,963 reviews
The dataset contains unique values for stars from 1 to 5 and for 836 books.
Usage
This dataset is well-suited for a variety of analytical and developmental purposes, including:
- Natural Language Processing (NLP) tasks such as sentiment analysis, text classification, and topic modelling using the review comments.
- Machine Learning (ML) model training for tasks like predicting book popularity or building recommendation engines.
- Business intelligence applications to gain insights into consumer preferences and market trends for data science literature.
- Research into review patterns, user feedback mechanisms, and e-commerce dynamics.
Coverage
The dataset has a global regional coverage. It was listed on 16/06/2025. No specific demographic or historical time range information beyond the listing date is available in the provided sources.
License
CCO
Who Can Use It
This dataset is particularly useful for:
- Data Scientists and Machine Learning Engineers engaged in building and testing text-based analytical models.
- Academic Researchers and Students focused on computational linguistics, consumer behaviour, or data science education.
- Market Analysts and Business Strategists looking to understand product perception and competitive landscapes within the book industry.
- Developers creating applications that require user-generated content for training or analysis.
Dataset Name Suggestions
- Amazon Data Science Book Reviews
- Data Science Book Customer Ratings Dataset
- Amazon DS Book Reviews and Ratings
- Data Science Book Feedback Collection
Attributes
Original Data Source: Amazon Data Science Book Reviews