Opendatabay APP

Isekai Light Novel Text Dataset

Entertainment & Media Consumption

Tags and Keywords

Text

Nlp

Anime

Manga

Multiclass

Classification

Multilabel

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Isekai Light Novel Text Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset contains titles and descriptions of Isekai anime light novels, collected to facilitate the development of AI models for generating similar content. Inspired by the concept of fictional light novel titles, it provides valuable text data for training language models and exploring genre classification within the fantasy and alternate world themes. The data was meticulously scraped from novelupdates.com, specifically filtering for light novels tagged with "Fantasy World" and "Alternate World".

Columns

  • Row #: A unique identifier for each entry.
  • Titles: The title of the light novel, provided as a string.
  • Descriptions: The synopsis or blurb of the light novel, also provided as a string.
  • Genres: A list of genres associated with each light novel, such as 'Action', 'Adventure', 'Fantasy', 'Harem', 'Romance', and 'Shounen'.
  • Links: URLs related to the light novel entries.

Distribution

The dataset is structured with 1366 unique entries, with each entry containing a title, description, and associated genres. While the exact file format for distribution is not specified, data files are typically provided in CSV format. Specific numbers for rows and records are available, with 1366 records across the key columns.

Usage

This dataset is ideal for a variety of applications, including:
  • Generating new light novel titles or descriptions using AI models.
  • Developing machine learning models to predict novel genres based on titles and descriptions, facilitating multi-class classification tasks.
  • Retraining large language models like GPT-2 to observe differences in generated text compared to other models.
  • Exploring techniques for text generation and natural language processing within creative writing contexts.
  • Conducting analyses on common themes, keywords, and stylistic elements in Isekai light novels.

Coverage

The dataset's coverage is global, drawing information from novelupdates.com. It focuses specifically on light novels tagged under "Fantasy World" and "Alternate World," ensuring a thematic focus on the Isekai genre. The data collection process involved a Python script utilising selenium and bs4. While specific time ranges for the original novels are not provided, the dataset itself was listed in 2025. It is recommended to filter out very short titles for optimal results in generation tasks.

License

CC0

Who Can Use It

This dataset is suitable for:
  • Researchers and academics focusing on natural language processing, text generation, and machine learning.
  • AI developers creating content generation tools or genre classification systems.
  • Data scientists seeking to explore and analyse structured text data from the entertainment sector.
  • Content creators and writers interested in leveraging AI to inspire new stories or analyse popular literary trends.
  • Students learning about data scraping, text analysis, and AI model training.

Dataset Name Suggestions

  • Isekai Light Novel Text Dataset
  • Anime Novel Titles & Descriptions
  • Fantasy World Novelisation Data
  • Light Novel AI Generation Corpus
  • Novelupdates Isekai Scraped Data

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

24/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format