Opendatabay APP

Retail Diamond Price Factors Dataset

Retail & Consumer Behavior

Tags and Keywords

Diamonds

Jewellery

Pricing

Retail

Analytics

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Retail Diamond Price Factors Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

A collection of data designed to demystify the value of the 4 Cs—cut, color, clarity, and carat—aims to make the process of buying a diamond less frustrating and expensive. It contains approximately 119,000 records for both natural and lab-created diamonds, scraped from brilliantearth.com. This data was collected to provide transparency into the diamond purchasing experience.

Columns

  • id: A unique identification number for the diamond provided by Brilliant Earth.
  • url: The URL for the diamond's specific details page.
  • shape: The external geometric appearance of a diamond (e.g., Round, Oval).
  • price: The price of the diamond in U.S. dollars.
  • carat: The weight of the diamond.
  • cut: The quality of a diamond's facets, symmetry, and reflective qualities (e.g., Super Ideal, Ideal).
  • color: The lack of colour visible within a diamond, based on the GIA grade scale.
  • clarity: The visibility of microscopic imperfections within a diamond (e.g., VS1, VS2).
  • report: The diamond grading report provided by an independent gemology lab (e.g., GIA, IGI).
  • type: Indicates whether the diamond is natural or lab-created.
  • date_fetched: The date the data was collected.

Distribution

The data is provided in a single CSV file named diamonds_dataset.csv with a size of 16.77 MB. It consists of approximately 119,000 rows and 11 columns.

Usage

This data is ideal for analysing the factors that influence diamond pricing. It can be used to build predictive models to estimate diamond values based on their characteristics. Researchers and analysts can also use it to compare the market differences between natural and lab-created diamonds.

Coverage

The data was collected from a single online source, brilliantearth.com, on a single date: 29 November 2020. It covers diamonds available for purchase in U.S. dollars at that time. There is no specific geographic or demographic scope beyond the customers of this international retailer.

License

Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

Who Can Use It

  • Data Scientists: To build machine learning models for price prediction and feature importance analysis.
  • Jewellery Industry Analysts: To study market trends, pricing strategies, and the valuation differences between natural and lab-grown diamonds.
  • Consumers and Shoppers: To gain insights into the 4 Cs and make more informed purchasing decisions.
  • Academics: For research into commodity pricing and consumer retail markets.

Dataset Name Suggestions

  • Brilliant Earth Diamond Pricing Data
  • Natural and Lab-Grown Diamond Characteristics
  • Diamond 4Cs Market Value Analysis
  • Retail Diamond Price Factors Dataset

Attributes

Listing Stats

VIEWS

3

DOWNLOADS

0

LISTED

17/09/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format