KDrama Content Analysis Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset features the top 250 Korean Dramas, as ranked on the MyDramaList website. It is provided in a CSV file format and is suitable for various data science projects. The collection addresses limitations found in existing datasets, which often had fewer titles (e.g., up to 100) and lacked essential features such as synopsis, tags, director names, cast names, and production company details. It was originally compiled to support the development of a content-based recommender system for Korean Dramas.
Columns
The dataset contains 17 columns in total, with the following key columns and their descriptions:
- Name: The title of the K-drama.
- Aired Date: The period from when the drama first aired to when it concluded.
- Year of release: The specific year the drama was initially released. This ranges from 2003 to 2022.
- Original Network: The cable television channel or over-the-top (OTT) platform where the drama first premiered. Notable networks include tvN (20%) and SBS (12%).
- Aired On: The days of the week on which the show was broadcast during its run. For example, Wednesday and Thursday (16%) or Monday and Tuesday (14%).
- Number of Episodes: The total count of episodes for the K-drama.
- Duration: The length of each episode, specified in hours and minutes. Common durations include 1 hour 10 minutes (22%) and 60 minutes (16%).
- Content Rating: The recommended audience rating for the drama. 15+ (Teens 15 or older) accounts for 86%, while 18+ (Restricted, due to violence & profanity) is 8%.
- Rating: The drama's rating at the time the dataset was compiled, based on user votes from the MyDramaList website. Ratings typically range from 8.3 to 9.2.
- Synopsis: A brief overview or plot summary of the K-drama.
Distribution
The dataset is provided as a CSV file, containing 251 rows and 17 columns. The data consists primarily of textual information. There are 250 unique drama titles and unique synopses.
Key distributions within the dataset include:
- Most K-dramas were released between 2018 and 2022, with 72 titles released from 2018.20 - 2020.10 and 71 titles from 2020.10 - 2022.00.
- A significant portion of dramas (68%) were aired on networks other than tvN or SBS.
- About 70% of dramas aired on days other than Monday/Tuesday or Wednesday/Thursday.
- Many episode durations are outside the 1 hr. 10 min. or 60 min. categories (62%).
- A small percentage (6%) of dramas have content ratings other than 15+ or 18+.
Usage
This dataset is highly valuable for various applications and use cases, including:
- Developing content-based recommender systems for Korean Dramas.
- Performing Natural Language Processing (NLP) tasks on drama synopses and descriptions.
- Conducting market research on popular culture and entertainment consumption trends in Korea.
- Analysing audience preferences and content rating impact on drama popularity.
- Creating data visualisations to explore trends in K-drama production and reception.
Coverage
The dataset focuses on Korean Dramas, covering a global region. The time range for drama releases spans from 2003 to 2022. Demographic scope is primarily defined by the content ratings (e.g., 15+, 18+) and the user ratings from the MyDramaList website, reflecting a diverse audience base interested in K-dramas.
License
CCO
Who Can Use It
This dataset is ideal for:
- Data Scientists working on machine learning models, particularly for recommendation engines.
- Machine Learning Engineers looking for rich textual data for NLP applications.
- Academic Researchers studying media trends, popular culture, or entertainment analytics.
- Students undertaking projects in data analysis, data science, or AI.
- Developers aiming to build applications that involve K-drama content.
Dataset Name Suggestions
- Top 250 Korean Dramas from MyDramaList
- MyDramaList K-Drama Ratings and Synopsis
- Korean Drama Popularity Dataset (2003-2022)
- KDrama Content Analysis Dataset
- Premier Korean Dramas Data Compendium
Attributes
Original Data Source: Top 250 Korean Dramas (KDrama) Dataset