2021 Global Spotify Streamed Songs
Social Media and Posts
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains the top 100 most streamed songs from Spotify in 2021, complete with various audio features extracted directly from the Spotify API. It was originally created to serve as the basis for a recommendation system. This collection offers a clear overview of the audio characteristics of popular music during that year, providing valuable insights for music analysis and data science projects.
Columns
- id: A unique identifier for each track.
- name: The title of the song.
- duration: The duration of the track, typically in a unit like minutes (e.g., ranging from approximately 0.73 to 10.2).
- energy: A perceptual measure representing the intensity and activity of a track, with higher values indicating more energetic songs (mean: 0.59).
- key: The key of the track, represented as an integer, where 0 represents C, 1 represents C#, and so on (mean: 5.05).
- loudness: The overall loudness of a track in decibels (dB), with values typically ranging from -60 to 0 dB (mean: -6.58).
- mode: Indicates the modality (major or minor) of a track, with 0 for minor and 1 for major (mean: 0.56).
- speechiness: Detects the presence of spoken words in a track, with values above 0.66 suggesting spoken word tracks (e.g., podcasts), values between 0.33 and 0.66 indicating music and speech, and values below 0.33 indicating music (mean: 0.08).
- acousticness: A measure of how acoustic a track is, with higher values indicating a higher probability the track is acoustic (mean: 0.31).
- instrumentalness: Predicts whether a track contains no vocals, with higher values representing a higher likelihood of no vocals (mean: 0.07).
- liveness: Detects the presence of an audience in the recording, with values above 0.8 suggesting a live recording (mean: 0.16).
- valence: A measure from 0.0 to 1.0 describing the musical positiveness conveyed by a track, with higher values indicating more positive (e.g., happy, cheerful) sounds (mean: 0.52).
- tempo: The estimated tempo of the track in beats per minute (BPM) (mean: 122).
- danceability: Describes how suitable a track is for dancing based on a combination of musical elements like tempo, rhythm stability, beat strength, and overall regularity (mean: 0.65).
Distribution
The dataset is provided as a CSV file named
top 100 streamed_songs.csv
. It has a file size of 12.23 kB and contains 14 distinct columns. The dataset comprises 100 records, each representing one of the top streamed songs.Usage
This dataset is ideal for:
- Developing and testing music recommendation systems.
- Analysing trends in popular music based on audio features.
- Studying the characteristics of highly streamed songs in 2021.
- Exploring the relationship between audio features and song popularity.
Coverage
This dataset covers songs streamed during the year 2021. The data represents the top 100 most streamed songs, which typically implies global popularity on the Spotify platform. No specific geographic or demographic segmentation is provided beyond the general scope of Spotify's top tracks.
License
CC0: Public Domain
Who Can Use It
This dataset is suitable for:
- Data Scientists and Machine Learning Engineers: For building and evaluating music recommendation models.
- Music Analysts and Researchers: To understand audio feature distributions and trends in popular music.
- Developers: Interested in working with Spotify API data or building music-related applications.
- Students and Educators: As a practical dataset for learning about data analysis, feature engineering, and recommendation systems.
Dataset Name Suggestions
- Spotify 2021 Top Tracks Audio Features
- Top 100 Spotify Songs 2021 (Audio Analysis)
- 2021 Global Spotify Streamed Songs
- Spotify API Top 100 (2021) Dataset
Attributes
Original Data Source: 2021 Global Spotify Streamed Songs