Opendatabay APP

Time Series Sales Forecast Practice

Retail & Consumer Behavior

Tags and Keywords

Sales

Time

Prediction

Stores

Products

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Time Series Sales Forecast Practice Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset provides synthetic time series data specifically designed for individuals looking to enhance their time series analysis skills. It simulates realistic sales data, incorporating various features such as long-term trends, yearly seasonality patterns, and weekday/weekend effects, along with some noise. The aim is to facilitate practice in predicting the 'number sold' feature. The data models weak correlations between products and stores, offering a practical scenario for forecasting challenges.

Columns

  • Date: Represents the specific date in datetime format.
  • store: Identifies the unique store.
  • product: Identifies the unique product.
  • number_sold: Indicates the quantity of an item sold.

Distribution

The dataset is provided as synthetic time series data, split into two main files: train.csv and test.csv. The test.csv file has a size of 487.83 KB. It is typical for such data to be in CSV format, although not explicitly stated for both files here. The test.csv contains 25.6k valid records across all its columns. Specific row counts for the train.csv file are not detailed in the provided information. The dataset features data for 7 unique stores and 10 unique products, and importantly, contains no null values.

Usage

This dataset is ideally suited for anyone wishing to practice and refine their time series forecasting capabilities. It can be used for building predictive models, particularly for sales forecasting. Users can employ metrics such as the Mean Absolute Percentage Error (MAPE) for assessing their solutions and comparing them with others.

Coverage

The dataset spans a 10-year period from 2010 to 2019. The train.csv file covers the years 2010 to 2018, while the test.csv file exclusively covers data for 2019. The dataset includes information for 7 distinct stores and 10 distinct products. There are no explicit geographic or demographic details provided, as it is a synthetic dataset.

License

CC0: Public Domain

Who Can Use It

This dataset is intended for:
  • Data Science Learners and Practitioners: To practice and improve their skills in time series analysis and forecasting.
  • Students and Researchers: For academic exercises, case studies, and experimenting with various time series models.
  • Machine Learning Engineers: For developing and testing predictive models for sales or demand forecasting in a controlled, synthetic environment.

Dataset Name Suggestions

  • Synthetic Sales Time Series
  • Time Series Sales Forecast Practice
  • Retail Sales Prediction Dataset
  • Simulated Sales Data for Forecasting
  • Sales Activity Time Series

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

13/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format