Polish Google Maps Fake Reviews
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains anonymised data of accounts and reviews, meticulously collected through scraping Google Maps. Its primary purpose is to support research and development in the field of fake review detection, specifically for Google Maps. The data is carefully labelled as either fake or real, providing a clear basis for classification tasks. It forms a crucial part of academic research focused on detecting deceptive online behaviour.
Columns
_id
: A unique identifier for each record.is_deleted
: A boolean value indicating if the account or review has been deleted.is_private
: A boolean value indicating if the account is private.is_real
: A boolean value, the core label, indicating if the account/review is considered real or fake.local_guide_level
: An integer representing the level of a Google Maps Local Guide associated with the account.name_score
: An integer score indicating the commonness of the account's name within the Polish language context. Further details are available in the related academic article.number_of_reviews
: An integer representing the total number of reviews posted by the account.
Distribution
The dataset is structured for ease of use, typically provided in a CSV format. It comprises 605 individual records, each representing anonymised account and review data. While the exact file size is not specified, its structure is designed for straightforward integration into data analysis and machine learning workflows.
Usage
This dataset is ideally suited for data science and analytics professionals, especially those focused on:
- Developing and testing machine learning models for binary classification of fake versus real reviews.
- Natural Language Processing (NLP) research related to review authenticity.
- Investigating patterns of deceptive accounts and reviews on online platforms.
- Applications in fraud detection and digital trust initiatives.
- Academic research in areas such as applied sciences and artificial intelligence.
Coverage
The data collection period predates May 2023, the publication date of the associated research. While the dataset is classified with a global region coverage, it includes a
name_score
column that specifically reflects the commonness of names in the Polish language, indicating a potential focus or origin from Polish-speaking contexts. The dataset encompasses anonymised attributes of Google Maps accounts and their review behaviour.License
CC By 4.0
Who Can Use It
This dataset is primarily intended for:
- Researchers and academics studying online fraud, social media analysis, and machine learning applications in authenticity detection.
- Data scientists and AI/ML engineers looking to build or refine models for identifying fake reviews.
- Organisations interested in protecting brand reputation or improving trust on their platforms.
- Students undertaking projects in data analysis, classification, or NLP related to online reviews.
Dataset Name Suggestions
- GMR-PL Fake Reviews Dataset
- Google Maps Account Authenticity Data
- Google Maps Review Classification Data
- Polish Google Maps Fake Reviews
- Online Review Authenticity Dataset
Attributes
Original Data Source: GMR-PL Fake reviews dataset