Amazon Audible Complete 2020 Data
Retail & Consumer Behavior
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset presents a detailed catalogue of audiobooks offered through Amazon's Audible service from 2020. It provides a complete listing of 6,368 audiobooks, capturing essential details such as book titles, authors, customer ratings, the number of reviews, and pricing information. Furthermore, it includes descriptions, listening times, and genre tags for each audiobook. This collection offers a valuable snapshot of the digital audiobook market, suitable for analysis of consumer trends and content performance within the sector.
Columns
- Book Name: The title of the audiobook. There are 5,396 unique book names recorded in the dataset.
- Author: The name of the author of the audiobook. The dataset contains 3,538 unique authors, with "Harvard Business Review" being the most frequently occurring author.
- Rating: The average rating of the audiobook on a scale of 0 to 5 stars. The average rating across the dataset is 3.91, with a significant number of titles rated between 4.4 and 5.0.
- Number of Reviews: The count of customer reviews submitted for each audiobook. The average number of reviews is 903, though 10% of records are missing this information.
- Price: The price of the audiobook when purchased without Audible credits. The average price is 923, with a small number of records missing price data.
- Description: A synopsis or summary of the audiobook's content.
- Listening Time: The total duration of the audiobook, provided in hours and minutes.
- Ranks and Genre Tags: A string containing various tags, including ranks and genres associated with the audiobook.
Distribution
The dataset is structured as a CSV file, with a size of 504.5 KB. It contains 6,368 records, representing the total number of audiobooks listed. While most features, such as Book Name, Author, Rating, and Price, are fully populated with 100% valid entries, the 'Number of Reviews' column has 631 (10%) missing values, and the 'Price' column has 3 missing entries. The 'Rating' column shows a mean of 3.91, with a standard deviation of 1.66, indicating a generally high rating distribution. The 'Number of Reviews' has a mean of 903, and the 'Price' has a mean of 923.
Usage
This dataset is ideal for a variety of applications, including:
- Market Research: Analysing trends in audiobook popularity, pricing strategies, and content types.
- Recommender Systems Development: Building and testing algorithms for suggesting audiobooks to users.
- Content Strategy: Identifying successful genres, authors, and audiobook characteristics.
- Data Analysis: Exploring relationships between ratings, reviews, prices, and listening times.
- E-commerce Insights: Understanding consumer engagement and purchasing behaviour on platforms like Audible.
Coverage
The dataset focuses on Amazon's Audible catalogue from 2020. The primary source of the data is Amazon.in, suggesting a focus on content available or popular within the Indian market. There is no specific demographic scope detailed beyond general customer reviews.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For developing predictive models, performing statistical analysis, and extracting market insights.
- E-commerce Professionals: To inform business decisions related to digital content offerings and pricing.
- Researchers: Studying trends in digital publishing and consumer media consumption.
- Students: For academic projects focusing on data analysis, machine learning, or digital humanities.
- Developers: To prototype recommendation engines or content discovery tools.
Dataset Name Suggestions
- Audible 2020 Audiobook Catalogue
- Amazon Audible Complete 2020 Data
- Audible.in Audiobooks 2020
- Digital Audiobook Catalogue 2020
- Audible Content Statistics 2020
Attributes
Original Data Source:Amazon Audible Complete 2020 Data