Harry Potter Fanfiction Metadata Archive
Product Reviews & Feedback
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This collection offers a detailed look at Harry Potter fanfiction scraped from FanFiction.net. It captures metadata for stories written over nearly two decades, intended for visual analysis and trend spotting. The resource focuses on story summaries, metrics, and demographics, rather than the full story text.
Columns
- Chapters: The number of chapters within the story, ranging from 1 up to 542.
- Favs: The number of users who have favourited the story, with a maximum recorded value of 27.9k.
- Follows: The count of users following the story, reaching a maximum of 19.6k.
- Published: The exact date the story was initially posted online.
- Reviews: The total number of reviews received by the story, up to 38.1k.
- Updated: The date the story was last modified or updated.
- Words: The story word count, which ranges significantly, with a maximum recorded value of 3.32 million words.
- author: The FanFiction.net username associated with the story creator.
- characters: Characters that feature prominently in the narrative.
- genre: The genre classification applied by the author (Romance is the most common at 14%).
- language: The language used in the story (English is the predominant language at 78%).
- rating: The alphabetical rating assigned by the author, as codified by FanFiction.net (T is the most frequent rating at 38%).
- story_link: The direct URL to the fanfiction story.
- synopsis: A summary or blurb written by the author.
- title: The title of the story.
- published_mmyy: The month and year of the original published date.
- pairing: The character pairings featured in the story (Draco M. and Hermione G. are the most common pairing cited).
Distribution
This product is typically delivered as a CSV data file, designated as
hpcleanvlarge1.csv. The file size is approximately 242.91 MB and it contains 17 distinct columns. The collection includes 648,000 records. The expected update schedule for this material is quarterly.Usage
Ideal applications for this data involve statistical analysis of fan culture and writing trends. Users can explore:
- Identifying the most popular character pairings within the fandom.
- Analysing the volume of fanfiction produced across various languages.
- Visualising trends in story creation and popularity since major book or film releases.
- Studying the correlation between story length (word count, chapter count) and engagement metrics (Favs, Follows, Reviews).
Coverage
The scope covers Harry Potter fanfiction entries scraped from FanFiction.net, focusing on content written between 2001 and 2019. Stories in all available languages are included, though English stories account for the majority of the data.
License
CC0: Public Domain
Who Can Use It
- Data Scientists and Analysts: For studying large-scale creative writing trends, distribution patterns, and popularity signals.
- Cultural Researchers: Investigating fandom evolution, demographic preferences, and the influence of source material milestones.
- Fan Community Developers: For creating informational dashboards and visualisations dedicated to the Harry Potter fandom.
Dataset Name Suggestions
- Harry Potter Fanfiction Metadata Archive
- HP Fanfic Trends 2001-2019
- FanFiction.net Harry Potter Story Metrics
- Structured Harry Potter Fanfic Blurbs
Attributes
Original Data Source:Harry Potter Fanfiction Metadata Archive
Loading...
