Opendatabay APP

Polish Google Maps Fake Reviews

Data Science and Analytics

Tags and Keywords

Classification

Nlp

Artificial

Binary

Polish

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Polish Google Maps Fake Reviews Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains anonymised data of accounts and reviews, meticulously collected through scraping Google Maps. Its primary purpose is to support research and development in the field of fake review detection, specifically for Google Maps. The data is carefully labelled as either fake or real, providing a clear basis for classification tasks. It forms a crucial part of academic research focused on detecting deceptive online behaviour.

Columns

  • _id: A unique identifier for each record.
  • is_deleted: A boolean value indicating if the account or review has been deleted.
  • is_private: A boolean value indicating if the account is private.
  • is_real: A boolean value, the core label, indicating if the account/review is considered real or fake.
  • local_guide_level: An integer representing the level of a Google Maps Local Guide associated with the account.
  • name_score: An integer score indicating the commonness of the account's name within the Polish language context. Further details are available in the related academic article.
  • number_of_reviews: An integer representing the total number of reviews posted by the account.

Distribution

The dataset is structured for ease of use, typically provided in a CSV format. It comprises 605 individual records, each representing anonymised account and review data. While the exact file size is not specified, its structure is designed for straightforward integration into data analysis and machine learning workflows.

Usage

This dataset is ideally suited for data science and analytics professionals, especially those focused on:
  • Developing and testing machine learning models for binary classification of fake versus real reviews.
  • Natural Language Processing (NLP) research related to review authenticity.
  • Investigating patterns of deceptive accounts and reviews on online platforms.
  • Applications in fraud detection and digital trust initiatives.
  • Academic research in areas such as applied sciences and artificial intelligence.

Coverage

The data collection period predates May 2023, the publication date of the associated research. While the dataset is classified with a global region coverage, it includes a name_score column that specifically reflects the commonness of names in the Polish language, indicating a potential focus or origin from Polish-speaking contexts. The dataset encompasses anonymised attributes of Google Maps accounts and their review behaviour.

License

CC By 4.0

Who Can Use It

This dataset is primarily intended for:
  • Researchers and academics studying online fraud, social media analysis, and machine learning applications in authenticity detection.
  • Data scientists and AI/ML engineers looking to build or refine models for identifying fake reviews.
  • Organisations interested in protecting brand reputation or improving trust on their platforms.
  • Students undertaking projects in data analysis, classification, or NLP related to online reviews.

Dataset Name Suggestions

  • GMR-PL Fake Reviews Dataset
  • Google Maps Account Authenticity Data
  • Google Maps Review Classification Data
  • Polish Google Maps Fake Reviews
  • Online Review Authenticity Dataset

Attributes

Original Data Source: GMR-PL Fake reviews dataset

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free