Opendatabay APP

Top 50 Amazon Books Dataset

Product Reviews & Feedback

Tags and Keywords

Books

Amazon

Bestsellers

Literature

Reviews

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Top 50 Amazon Books Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset presents a detailed overview of the Amazon Top 50 Bestselling Books from 2009 up to March 2022. It was created for a Coursera Google Data Analytics Certificate capstone case study project, initially using data from 2009-2019 and then extended by scraping Amazon for data from 2020, 2021, and the beginning of 2022 (January 1 to March 26). It offers valuable insights into popular literature trends and consumer preferences on Amazon over more than a decade.
  • Columns

The dataset comprises 7 distinct columns, each providing specific details about the bestselling books:
  • Name: The title of the book. The most common title found is "Publication Manual of the American Psychological Association, 6th Edition", appearing in 1% of entries, with "The Very Hungry Caterpillar" also at 1%. There are 441 unique book titles.
  • Author: The author of the book. Jeff Kinney and Gary Chapman are the most frequent authors, each accounting for 2% of entries. There are 305 unique authors recorded.
  • User Rating: The average user rating for the book, measured out of 5 stars. Ratings range from 3.3 to 4.9, with a mean rating of 4.64. Most ratings fall between 4.58 and 4.90 stars.
  • Reviews: The number of reviews the book has received. The number of reviews varies widely, from a minimum of 37 to a maximum of 209,000, with a mean of 19,300 reviews.
  • Price: The price of the book, rounded. Prices range from £0 to £105, with a mean price of £12.70. A large proportion of books (339 entries) are priced between £0 and £10.50.
  • Year: The year the book appeared on the bestseller list. The data spans from 2009 to 2022, with a mean year of 2020.
  • Genre: The type of book, categorised as either Non-Fiction or Fiction. Non-Fiction books make up 55% of the dataset, while Fiction accounts for 45%.
  • Distribution

The dataset is provided in a CSV format (bestsellers_with_categories_2022_03_27.csv). It has a file size of 65.55 kB. The dataset contains 700 valid records across its 7 columns, with no mismatched or missing data points reported for any column.
  • Usage

This dataset is ideal for various analytical applications, including:
  • Market research into book publishing trends.
  • Analysis of consumer preferences and buying habits for books.
  • Identifying patterns in book popularity over time.
  • Exploring the relationship between book attributes (rating, reviews, price, genre) and bestseller status.
  • Academic research in literature, media, and data analytics.
  • Coverage

The dataset covers Amazon's Top 50 Bestselling Books specifically. The time range for the data is from 2009 to March 26, 2022. Data was originally sourced up to 2019 and subsequently updated for 2020, 2021, and the beginning of 2022.
  • License

CC0: Public Domain
  • Who Can Use It

This dataset is suitable for:
  • Data analysts and data scientists seeking real-world data for practice or projects.
  • Researchers studying literature, consumer behaviour, or e-commerce trends.
  • Publishers and authors interested in understanding bestseller characteristics and market dynamics.
  • Students undertaking projects related to data analytics and market analysis.
  • Dataset Name Suggestions

  • Amazon Bestselling Books 2009-2022
  • Top 50 Amazon Books Dataset
  • Global Bestsellers Analytics
  • Amazon Book Trends Data
  • Literary Sales Data 2009-2022
  • Attributes

Original Data Source: Top 50 Amazon Books Dataset

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

13/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format