Opendatabay APP

Polish Book Reviews and Metrics

Product Reviews & Feedback

Tags and Keywords

Polish

Books

Reviews

Ratings

Literature

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Polish Book Reviews and Metrics Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This collection of book data, primarily in Polish, captures user engagement and publication details for hundreds of thousands of titles. Sourced from a major Polish online forum dedicated to books, the data includes user scores, counts of readers, owners, and those who favour or wish to read the books. The resource is ideal for studying regional literary trends, understanding user sentiment, and analyzing the characteristics of books popular within the Polish reading community. Users should note that the dataset reflects raw forum data and may contain unexpected or null values in several fields.

Columns

The dataset contains 21 distinct columns:
  • Title: The title of the book, usually translated into Polish.
  • OriginalTitle: The book's original title, often missing for Polish original works or untranslated records.
  • Publisher: The name of the publishing entity.
  • Author: The name(s) of the book's author(s).
  • Translator: The name(s) of the translator(s). This field is frequently missing.
  • Score: The user-assigned rating, ranging from 0.0 to 10.0. The average score is approximately 6.89.
  • AmountOfScores: The total count of scores submitted for the book.
  • AmountOfComments: The total number of user comments associated with the title.
  • Part: Indicates which part of a book series the record represents.
  • Read: The number of users who have marked the book as read.
  • Own: The number of users who have marked the book as owned.
  • Favorite: The count of users who have assigned the book as a favourite.
  • WantToRead: The count of users who have indicated they wish to read the book.
  • Category: The assigned book category (e.g., komiksy, literatura piękna).
  • Pages: The number of pages in the book. The average length is around 248 pages.
  • Tags: User-defined book tags.
  • PublishDate: The date the specific edition was published.
  • FirstPublishDate: The date the book was initially published globally.
  • FirstPolishPublishDate: The date of the first Polish edition release.
  • Language: The language of the edition, predominantly Polish (81%) and English (16%).
  • ISBN: The International Standard Book Number.

Distribution

The dataset, provided as a single CSV file named book_data.csv, occupies 71.73 MB. It comprises over 300,000 records, each featuring 21 distinct attributes. While the entire structure is present, some key fields like OriginalTitle, Translator, Part, and user engagement metrics like Favorite have a significant percentage of missing values. The data collection is static, with no expected updates.

Usage

The data is excellently suited for:
  • Building advanced recommendation systems for Polish readers.
  • Performing detailed analysis of book popularity, distinguishing between user interest (WantToRead) and actual consumption (Read/Own).
  • Conducting sentiment analysis on user scores across various literary categories.
  • Market research focused on the Polish publishing industry, identifying key publishers and popular genres.

Coverage

The data covers books reviewed on a major Polish forum, focusing primarily on Polish language editions and titles translated into Polish. Publication dates range widely, with records spanning from as early as the 11th century up to 2023. The vast majority of the publication dates fall within the period of 1983 to 2023. The geographic and demographic scope is centred exclusively on the Polish online reading community.

License

CC0: Public Domain

Who Can Use It

Intended users include data scientists interested in developing multilingual content algorithms, academic researchers studying the sociology of literature or online review forums, and publishers seeking granular insights into the Polish book market.

Dataset Name Suggestions

  • Polish Book Reviews and Metrics
  • Lubimyczytac Book Data
  • Polish User-Scored Book Catalog
  • European Book Ratings Data

Attributes

Original Data Source: Polish Book Reviews and Metrics

Listing Stats

VIEWS

8

DOWNLOADS

1

LISTED

31/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format