Opendatabay APP

Webtoon Metadata Collection Dataset

Entertainment & Media Consumption

Tags and Keywords

Arts

Anime

Nlp

Comics

Recommender

Korea

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Webtoon Metadata Collection Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This webtoon dataset offers a rich collection of information on digital comics, primarily sourced from Naver Webtoon and Naver Best Challenge. It includes over 2,100 unique webtoons serialised on Naver Webtoon and over 3,100 records from Naver Best Challenge. The dataset was last updated on 31 December 2022, with new webtoons added and duplicate entries fixed for the challenge dataset. It provides a detailed look into entertainment and media consumption, making it valuable for a variety of analytical and machine learning applications.

Columns

The primary webtoon dataset (e.g., naver.csv) includes the following columns:
  • id: A unique identifier for each webtoon.
  • title: The name of the webtoon.
  • author: The writer of the webtoon.
  • genre: The style or category of the webtoon.
  • description: An introduction or summary of the webtoon's content.
  • rating: An average rating for the webtoon, typically out of 10.
  • date: The most recent update date for the webtoon.
  • completed: Indicates the completion status of the webtoon (e.g., true/false).
  • age: The recommended age for viewing the webtoon.
  • free: Denotes the availability of a free service event, such as "wait for free" (기다리면 무료).
  • link: A direct link to the webtoon.
The Naver Best Challenge dataset (naver_challenge.csv) contains similar columns but with some variations, including summary, format, serialize (whether officially serialised on Naver), and potenup (if chosen as potential up).

Distribution

This dataset is typically provided in a CSV format and is structured into two main files: naver.csv for webtoons serialised on Naver Webtoon, and naver_challenge.csv for webtoons from Naver Best Challenge. The collection features more than 1,850 cartoons in total, with over 2,100 unique webtoons in the main dataset and over 3,100 records in the challenge dataset. Specific numbers for total rows/records are detailed for various categories like label counts (e.g., 15.4k to 804k for some labels) and boolean counts for completion status (e.g., 1,406 true, 694 false).

Usage

This dataset is ideal for:
  • Building recommender systems for digital comics and webtoons.
  • Natural Language Processing (NLP) tasks, such as content analysis of descriptions and titles.
  • Machine learning and artificial intelligence applications, including trend prediction and user segmentation.
  • Analysing patterns in entertainment and media consumption.
  • Academic research into digital media trends and user engagement with webtoon platforms.

Coverage

The dataset's coverage is primarily geographic to Korea, focusing on content from the Naver Webtoon platform and its associated Best Challenge section. The time range for updates spans from 3 August 2006 to 31 December 2022. Demographic scope is addressed through the recommended age classifications, with significant proportions for "all ages" (전체연령가) and "15+ years" (15세 이용가) categories.

License

CC-BY-SA

Who Can Use It

This dataset is suitable for:
  • Data scientists looking to develop predictive models or data-driven insights in the entertainment sector.
  • Machine learning engineers interested in creating recommendation engines or content classification systems for webtoons.
  • Researchers studying digital media trends, cultural consumption patterns in Korea, or the evolution of webtoon content.
  • Students and hobbyists keen to explore and analyse large datasets related to popular culture and media.

Dataset Name Suggestions

  • Korean Webtoon Data
  • Naver Webtoon & Challenge Data
  • Digital Comics Analytics Dataset
  • Webtoon Metadata Collection

Attributes

Original Data Source: Webtoon Dataset in Korean

Listing Stats

VIEWS

2

DOWNLOADS

2

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free