BTS Discography & Lyrics Data
News & Media Articles
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information and English translated lyrics for songs by the popular music group BTS, including both group tracks and individual member (solo) songs. It is designed to offer insights into their musical output and lyrical content. The data was primarily gathered from Genius, Big Hit Entertainment, and Spotify, providing a rich resource for analysis of their discography and audio characteristics.
Columns
- id: A unique identifier for each track.
- album_title: The title of the album.
- eng_album_title: The title of the album without non-English characters.
- album_rd: The release date of the album, presented in ISO format (YYYY-MM-DD).
- album_seq: The sequence number of the track within its album.
- track_title: The title of the track.
- raw_track_title: The title of the track, including non-English characters.
- eng_track_title: The title of the track without non-English characters.
- lyrics: The English translated lyrics of the track, primarily sourced from Genius unless otherwise credited.
- hidden_track: A boolean indicator specifying if the track is a hidden track (true/false).
- remix: A boolean indicator specifying if the track is a remix (true/false).
- featured: Indicates any artists featured in the track.
- performed_by: Indicates which BTS member(s) or the group performed the track.
- repackaged: A boolean indicator specifying if the track was previously released in an earlier album (true/false).
- lang: The language of the track, specified using ISO 639-2 (alpha 3) codes.
- has_full_ver: A boolean indicator specifying if the track has a full version (true/false).
- is_alt_lang_ver: A boolean indicator specifying if the track is an alternate language version of a previously existing track (true/false).
- spotify_album_id: Spotify's unique base-62 identifier for the album.
- spotify_track_duration_ms: The track's duration in milliseconds.
- spotify_track_id: Spotify's unique base-62 identifier for the track.
- spotify_track_danceability: A measure describing how suitable a track is for dancing.
- spotify_track_energy: A perceptual measure representing the intensity and activity of a track.
- spotify_track_key: The key the track is in, according to Pitch Class notation.
- spotify_track_loudness: The overall loudness of a track in decibels (dB).
- spotify_track_mode: Indicates the modality (major or minor) of a track.
- spotify_track_speechiness: Detects the presence of spoken words in a track.
- spotify_track_acousticness: A confidence measure indicating whether the track is acoustic.
- spotify_track_instrumentalness: Predicts whether a track contains no vocals.
- spotify_track_liveness: Detects the presence of an audience in the recording.
- spotify_track_valence: Describes the musical positiveness conveyed by a track.
- spotify_track_tempo: The overall estimated tempo of a track in beats per minute (BPM).
- spotify_track_time_signature: The estimated time signature, representing the number of beats per bar or measure.
- eng_lyrics_source_url: The URL of the English translated lyrics if the translations were not retrieved from Genius.
- eng_lyrics_credits: Any required credits for the English translated lyrics if they were not retrieved from Genius.
Distribution
The dataset is provided in CSV format and includes 34 columns. It contains data for 444 unique tracks, encompassing a wide range of songs by BTS as a group and individual members. The file size is 1.01 MB.
Usage
This dataset is ideal for music analysis, lyrical studies, and popular culture research. It can be used to explore trends in BTS's discography, analyse the sentiment and themes within their English translated lyrics, and investigate the Spotify audio features associated with their tracks. It supports quantitative and qualitative studies related to the group's musical evolution and impact.
Coverage
The dataset covers BTS songs released from 12th June 2013 through to estimated releases up to 18th July 2025. It includes both songs performed by BTS as a group and songs by individual BTS members (solos). The lyrics provided are English translations. The primary languages of the tracks are Korean (66%) and Japanese (24%), with other languages making up the remainder.
License
CC0: Public Domain
Who Can Use It
- Music analysts and researchers interested in K-Pop trends and musical characteristics.
- Data scientists and students looking for real-world datasets for text analysis, audio feature analysis, or time-series studies.
- Fans and enthusiasts eager to delve deeper into the statistics and lyrical content of BTS's music.
- Developers building applications related to music discovery, recommendation, or lyrical content.
Dataset Name Suggestions
- BTS Discography & Lyrics Data
- BTS Spotify Audio Features & English Lyrics
- K-Pop BTS Song Metrics
- BTS Music Data Collection
- Global K-Pop: BTS Song Analysis
Attributes
Original Data Source: BTS Discography & Lyrics Data