Opendatabay APP

Brazilian Books Reader Engagement Dataset

Education & Learning Analytics

Tags and Keywords

Books

Literature

Brazil

Reading

Analytics

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Brazilian Books Reader Engagement Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

An extensive look into the Brazilian literary scene, this collection of data offers information on over 11,000 books mined from Skoob, a popular Brazilian book cataloguing website. Inspired by the Goodreads-books dataset, this resource provides a wide array of features for each book, making it a valuable tool for analysing reading trends, book popularity, and reader engagement within the Brazilian market. The data includes details on authors, publishers, publication years, reader ratings, and engagement metrics such as the number of reviews and how many users have read or want to read a book.

Columns

  • titulo: The publication title of the book.
  • autor: The author who published the book.
  • ISBN_13: The 13-digit International Standard Book Number.
  • ISBN_10: The 10-digit International Standard Book Number.
  • ano: The year the book was published.
  • paginas: The total number of pages in the book.
  • idioma: The language of the book's publication.
  • editora: The publisher of the book.
  • rating: The average reader rating on a scale of 0-5.
  • avaliacao: The total number of ratings received.
  • resenha: The total number of reviews written for the book.
  • abandonos: The number of users who stopped reading the book.
  • relendo: The number of users currently re-reading the book.
  • querem_ler: The number of users who want to read the book.
  • lendo: The number of users currently reading the book.
  • leram: The total number of users who have read the book.
  • descricao: A short description or synopsis of the book.
  • genero: The genre(s) of the book, separated by '/'.
  • male: The percentage of readers identified as male.
  • female: The percentage of readers identified as female.

Distribution

The dataset is provided in a single CSV file named dados.csv, with a size of 13.08 MB. It contains information organised into 20 columns and approximately 12,000 records, each corresponding to a unique book.

Usage

This data is ideal for building recommendation systems tailored to Brazilian readers, conducting market analysis of the publishing industry, and performing academic research on literary trends. It can also be used for sentiment analysis based on book descriptions and reviews, or for creating visualisations that map author popularity and genre preferences over time.

Coverage

The dataset focuses primarily on books available and catalogued within the Brazilian market, with the majority of titles published in Portuguese. The data covers a range of publication years up to 2021. It also provides demographic insights by detailing the gender split of the readership for each book.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists can build predictive models for book sales or create sophisticated book recommendation engines.
  • Publishing Professionals can analyse market trends, identify popular genres, and track author performance.
  • Academic Researchers can study reading habits, genre evolution, and the cultural impact of literature in Brazil.
  • Developers can create applications for book lovers, such as discovery platforms or reading trackers.

Dataset Name Suggestions

  • Brazilian Books Reader Engagement Dataset
  • Skoob Brazilian Literature Metrics
  • Reader Analytics for Brazilian Books
  • Skoob Book Catalogue and User Metrics

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format