Opendatabay APP

Predict Future Sales Translated Data

Retail & Consumer Behavior

Tags and Keywords

Sales

Retail

Kaggle

Forecasting

Translation

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Predict Future Sales Translated Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Essential English translations of the Russian-language files provided in the 'Predict Future Sales' Kaggle competition allow for accessible feature engineering and reference. The files cover supplemental information regarding shops, items, and item categories, enabling non-Russian speakers to analyse the relationships between specific products and store locations effectively without language barriers.

Columns

  • ID: An identifier representing a (Shop, Item) tuple within the test set.
  • shop_id: Unique identifier for a specific shop.
  • item_id: Unique identifier for a specific product.
  • item_name: The name of the item (translated to English).
  • shop_name: The name of the shop (translated to English).
  • item_category_name: The name of the item category (translated to English).
  • item_category_id: A unique identifier for each item category.

Distribution

The dataset consists of tabular data files in CSV format, specifically items.csv, shops.csv, and item_categories.csv. The item_categories.csv file is approximately 2.23 kB in size and contains 84 unique rows (categories). The data is static with an expected update frequency of 'Never'.

Usage

  • Kaggle Competition Entry: Essential for English-speaking participants in the 'Predict Future Sales' competition.
  • Feature Engineering: Facilitates the creation of new features based on English text descriptions.
  • Sales Forecasting: Building models to predict future sales figures using translated metadata.
  • Text Analysis: Comparing Russian and English descriptions for NLP practice.

Coverage

The data covers the retail context of the 'Predict Future Sales' competition. It specifically provides translations for items, shops, and categories originally presented in Russian. The item_categories.csv file indicates 100% validity across 84 unique values, with no missing data.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists
  • Machine Learning Engineers
  • Kaggle Competitors
  • Retail Analysts
  • Students learning Time Series Analysis

Dataset Name Suggestions

  • Predict Future Sales Translated Data
  • English Translations for Kaggle Sales Competition
  • Retail Shops and Items (English Version)
  • Translated Russian Sales Metadata

Attributes

Original Data Source:

Listing Stats

VIEWS

4

DOWNLOADS

0

LISTED

08/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format