Isekai Light Novel Text Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset contains titles and descriptions of Isekai anime light novels, collected to facilitate the development of AI models for generating similar content. Inspired by the concept of fictional light novel titles, it provides valuable text data for training language models and exploring genre classification within the fantasy and alternate world themes. The data was meticulously scraped from novelupdates.com, specifically filtering for light novels tagged with "Fantasy World" and "Alternate World".
Columns
- Row #: A unique identifier for each entry.
- Titles: The title of the light novel, provided as a string.
- Descriptions: The synopsis or blurb of the light novel, also provided as a string.
- Genres: A list of genres associated with each light novel, such as 'Action', 'Adventure', 'Fantasy', 'Harem', 'Romance', and 'Shounen'.
- Links: URLs related to the light novel entries.
Distribution
The dataset is structured with 1366 unique entries, with each entry containing a title, description, and associated genres. While the exact file format for distribution is not specified, data files are typically provided in CSV format. Specific numbers for rows and records are available, with 1366 records across the key columns.
Usage
This dataset is ideal for a variety of applications, including:
- Generating new light novel titles or descriptions using AI models.
- Developing machine learning models to predict novel genres based on titles and descriptions, facilitating multi-class classification tasks.
- Retraining large language models like GPT-2 to observe differences in generated text compared to other models.
- Exploring techniques for text generation and natural language processing within creative writing contexts.
- Conducting analyses on common themes, keywords, and stylistic elements in Isekai light novels.
Coverage
The dataset's coverage is global, drawing information from novelupdates.com. It focuses specifically on light novels tagged under "Fantasy World" and "Alternate World," ensuring a thematic focus on the Isekai genre. The data collection process involved a Python script utilising selenium and bs4. While specific time ranges for the original novels are not provided, the dataset itself was listed in 2025. It is recommended to filter out very short titles for optimal results in generation tasks.
License
CC0
Who Can Use It
This dataset is suitable for:
- Researchers and academics focusing on natural language processing, text generation, and machine learning.
- AI developers creating content generation tools or genre classification systems.
- Data scientists seeking to explore and analyse structured text data from the entertainment sector.
- Content creators and writers interested in leveraging AI to inspire new stories or analyse popular literary trends.
- Students learning about data scraping, text analysis, and AI model training.
Dataset Name Suggestions
- Isekai Light Novel Text Dataset
- Anime Novel Titles & Descriptions
- Fantasy World Novelisation Data
- Light Novel AI Generation Corpus
- Novelupdates Isekai Scraped Data
Attributes
Original Data Source: Isekai Anime Light Novel Titles and Descriptions