One Million Bandcamp Transactions
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
A collection of 1,000,000 recorded sales transactions originating from the Bandcamp platform’s sales feed. This data provides detailed visibility into the e-commerce activity of independent music and merchandise, tracking crucial details such as the selling artist, the item type (digital or physical), and various financial metrics. This particular slice of data captures a focused period of market activity in late 2020 and was originally compiled as part of research for "The Chaos Bazaar."
Columns
The dataset contains 23 fields, capturing transactional and item-specific details. Notable columns include:
- _id: A unique identifier generated by combining the sale’s URL and its UTC timestamp.
- url: The web path to the sold item on the platform. This can be used to join this dataset to a dataset of Bandcamp items.
- artist_name: The name of the independent artist or creator associated with the sale (valid for 100% of records).
- album_title: The title of the album, when applicable (present in approximately 36% of records).
- item_type: Categorises the product sold: 'a' for digital albums (48% of sales), 'p' for physical items, and 't' for digital tracks (27% of sales).
- slug_type: Another designation for the object type ('a' for all albums, 'p' for merchandise, and 't' for tracks).
- utc_date: The precise UTC timestamp of the transaction datetime.
- country_code: The two-letter country code of the buyer.
- country: The full name of the buyer's country (e.g., United States).
- item_price: The advertised price of the item in the seller's original currency.
- currency: The currency used by the seller (USD is the most frequent at 46%, followed by EUR at 26%).
- amount_paid: The final amount paid in the seller's currency.
- amount_paid_fmt: The paid amount formatted with the relevant currency symbol (e.g., $1, which is the most common single format at 8%).
- amount_paid_usd: The final amount paid by the buyer, standardised in US Dollars.
- amount_over_fmt: The voluntary amount, if any, paid over the base item price by the buyer (missing in 88% of records).
Distribution
This dataset comprises one million (1,000,000) individual sales records. The file, typically offered in CSV format, has a size of approximately 294.57 MB. There are 23 columns provided in total. The data is static, meaning there is no expected update frequency.
Usage
The data is highly suitable for several analytical applications, including:
- Market Analysis: Studying sales volume, pricing strategies, and geographical distribution of e-commerce purchases.
- Natural Language Processing (NLP): Analysing textual fields such as artist names and item descriptions for product characterisation.
- Forecasting Price and Demand: Utilising the time-series sales information to model future trends.
- Recommender System Development: Building models based on buyer purchase patterns.
Coverage
The transactional records span a period from 9th September 2020 through to 2nd October 2020. Geographically, buyers originate from 186 unique countries. The United States accounts for the largest share of sales activity, representing 40% of the records. The scope covers physical merchandise, digital albums, and single digital tracks sold via the Bandcamp platform.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For quantitative modelling of e-commerce and consumer behaviour.
- Music Industry Professionals: To gain insight into global independent music sales trends and popular item formats.
- Academic Researchers: Those studying online community economies, pricing dynamics, or digital goods distribution.
Dataset Name Suggestions
- Bandcamp Sales Transactions September 2020
- One Million Bandcamp Items Sold
- Independent Music E-Commerce Feed
Attributes
Original Data Source:One Million Bandcamp Transactions
Loading...
