Opendatabay APP

KDrama Content Analysis Dataset

Entertainment & Media Consumption

Tags and Keywords

Movies

Nlp

Recommender

Popular

Korea

Trusted By
Trusted by company1Trusted by company2Trusted by company3
KDrama Content Analysis Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset features the top 250 Korean Dramas, as ranked on the MyDramaList website. It is provided in a CSV file format and is suitable for various data science projects. The collection addresses limitations found in existing datasets, which often had fewer titles (e.g., up to 100) and lacked essential features such as synopsis, tags, director names, cast names, and production company details. It was originally compiled to support the development of a content-based recommender system for Korean Dramas.

Columns

The dataset contains 17 columns in total, with the following key columns and their descriptions:
  • Name: The title of the K-drama.
  • Aired Date: The period from when the drama first aired to when it concluded.
  • Year of release: The specific year the drama was initially released. This ranges from 2003 to 2022.
  • Original Network: The cable television channel or over-the-top (OTT) platform where the drama first premiered. Notable networks include tvN (20%) and SBS (12%).
  • Aired On: The days of the week on which the show was broadcast during its run. For example, Wednesday and Thursday (16%) or Monday and Tuesday (14%).
  • Number of Episodes: The total count of episodes for the K-drama.
  • Duration: The length of each episode, specified in hours and minutes. Common durations include 1 hour 10 minutes (22%) and 60 minutes (16%).
  • Content Rating: The recommended audience rating for the drama. 15+ (Teens 15 or older) accounts for 86%, while 18+ (Restricted, due to violence & profanity) is 8%.
  • Rating: The drama's rating at the time the dataset was compiled, based on user votes from the MyDramaList website. Ratings typically range from 8.3 to 9.2.
  • Synopsis: A brief overview or plot summary of the K-drama.

Distribution

The dataset is provided as a CSV file, containing 251 rows and 17 columns. The data consists primarily of textual information. There are 250 unique drama titles and unique synopses. Key distributions within the dataset include:
  • Most K-dramas were released between 2018 and 2022, with 72 titles released from 2018.20 - 2020.10 and 71 titles from 2020.10 - 2022.00.
  • A significant portion of dramas (68%) were aired on networks other than tvN or SBS.
  • About 70% of dramas aired on days other than Monday/Tuesday or Wednesday/Thursday.
  • Many episode durations are outside the 1 hr. 10 min. or 60 min. categories (62%).
  • A small percentage (6%) of dramas have content ratings other than 15+ or 18+.

Usage

This dataset is highly valuable for various applications and use cases, including:
  • Developing content-based recommender systems for Korean Dramas.
  • Performing Natural Language Processing (NLP) tasks on drama synopses and descriptions.
  • Conducting market research on popular culture and entertainment consumption trends in Korea.
  • Analysing audience preferences and content rating impact on drama popularity.
  • Creating data visualisations to explore trends in K-drama production and reception.

Coverage

The dataset focuses on Korean Dramas, covering a global region. The time range for drama releases spans from 2003 to 2022. Demographic scope is primarily defined by the content ratings (e.g., 15+, 18+) and the user ratings from the MyDramaList website, reflecting a diverse audience base interested in K-dramas.

License

CCO

Who Can Use It

This dataset is ideal for:
  • Data Scientists working on machine learning models, particularly for recommendation engines.
  • Machine Learning Engineers looking for rich textual data for NLP applications.
  • Academic Researchers studying media trends, popular culture, or entertainment analytics.
  • Students undertaking projects in data analysis, data science, or AI.
  • Developers aiming to build applications that involve K-drama content.

Dataset Name Suggestions

  • Top 250 Korean Dramas from MyDramaList
  • MyDramaList K-Drama Ratings and Synopsis
  • Korean Drama Popularity Dataset (2003-2022)
  • KDrama Content Analysis Dataset
  • Premier Korean Dramas Data Compendium

Attributes

Listing Stats

VIEWS

2

DOWNLOADS

0

LISTED

05/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free