Opendatabay APP

Book Ratings and Reviews Dataset

Product Reviews & Feedback

Tags and Keywords

Books

Goodreads

Literature

Ratings

Reviews

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Book Ratings and Reviews Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides an extensive collection of 100,000 books from Goodreads, designed as a personal initiative to develop web scraping skills and to offer a valuable dataset to the community. It includes essential columns needed to characterise a book, making it a useful resource for various analytical and developmental projects.

Columns

  • author: The name(s) of the book's author. There are 68,767 unique authors listed.
  • bookformat: The physical format of the book, such as Paperback, Hardcover, or other. Paperback accounts for 54% of entries, while Hardcover is 28%.
  • desc: The textual description of the book. About 7% of entries are missing this information.
  • genre: A list of genres associated with the book. Approximately 10% of entries are missing genre information.
  • img: A direct link to the book's cover image.
  • isbn: The International Standard Book Number (ISBN) for the book. Around 14% of entries do not have an ISBN.
  • isbn13: The 13-digit International Standard Book Number. Roughly 11% of entries are missing this code.
  • link: The direct link to the book's page on Goodreads. This column has 100,000 unique values.
  • pages: The total number of pages in the book. The mean number of pages is 255.
  • rating: The average user rating of the book on Goodreads. Ratings range from 0 to 5, with a mean of 3.83.
  • reviews: The total number of reviews the book has received. The average number of reviews is 182.
  • title: The official title of the book. There are 97,589 unique titles.
  • totalratings: The overall count of ratings given to the book. The mean total ratings is 2,990.

Distribution

This dataset is provided in a CSV file format, typical for tabular data. It contains 100,000 individual book records, and the file size is 120.17 MB.

Usage

This dataset is ideally suited for developing book recommendation systems, conducting literary analysis, exploring publishing trends, and performing text mining on book descriptions and reviews. It can also be used for academic research into reader behaviour and genre popularity.

Coverage

The dataset is drawn from Goodreads, a widely used global platform. It covers a broad range of books without specific stated geographical or demographic limitations. No particular time range is specified, but updates are expected annually, suggesting ongoing relevance.

License

CC0: Public Domain

Who Can Use It

This dataset is intended for a wide audience, including data scientists, machine learning engineers, literary researchers, software developers building applications related to books, and anyone interested in exploring book metadata for personal or public projects. It aims to provide a usable resource for the community.

Dataset Name Suggestions

  • Goodreads Book Data 100K
  • Extensive Goodreads Book Collection
  • Book Ratings and Reviews Dataset
  • Literary Dataset from Goodreads
  • Goodreads Books Metadata

Attributes

Original Data Source: Book Ratings and Reviews Dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

13/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format