Italian Viral Fake News Dataset
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures the social media discourse surrounding the "Gubbio raw fish incident", a viral Italian fake news story that captivated national media attention in October 2022. It details the spread of a humorous, yet exaggerated, account of mass dysentery at a raw fish lunch in Gubbio, Umbria. The incident generated a significant surge in social media activity, notably over 12,000 tweets on a single day. The data offers insights into the rapid virality of misinformation and public reaction to absurd events, including content that may contain strong Italian language. The story, while largely debunked, reflects a moment of shared national laughter, coinciding ironically with a significant political event.
Columns
- ID: A unique identifier for each social media post.
- author_id: An identifier for the author of the social media post.
- author_name: The full name of the author who created the post.
- author_username: The username of the author.
- created_at: The date and time when the social media post was published.
- edit_history_tweet_ids: A record of any previous versions or edits made to the tweet.
- public_metrics: This field contains various engagement metrics for the post, including:
- retweet_count: The total number of times the post was retweeted.
- reply_count: The total number of replies the post received.
- like_count: The total number of likes on the post.
- quote_count: The total number of times the post was quoted.
- text: The actual content or body of the social media post.
Distribution
The dataset is typically provided in a CSV file format, suitable for tabular analysis. It contains approximately 19,000 individual records, representing social media posts. Each record is structured with distinct columns, as detailed above, to provide a clear and organised view of the social media content and its associated metrics.
Usage
This dataset is ideal for:
- Analysing the spread and virality of fake news and misinformation on social media platforms.
- Studying social media trends, public sentiment, and collective reactions to unique or humorous cultural events.
- Conducting Natural Language Processing (NLP) research, particularly on text data in the Italian language, which may include informal or NSFW content.
- Investigating the impact of viral memes and online humour on public discourse.
- Examining media consumption patterns and how stories evolve from initial reports to widespread social phenomena.
Coverage
The dataset primarily focuses on social media activity from 19th October 2022 to 25th October 2022, capturing the peak period of the "Gubbio incident's" virality. Its geographic scope is centred on Italy, specifically reflecting discourse related to an event in Gubbio, Umbria, and engagement from the Italian-speaking social media audience. The data contains content in the Italian language, including potentially NSFW material, which should be considered by users.
License
CC0
Who Can Use It
This dataset is suitable for:
- Academic Researchers interested in social media analytics, linguistics, and the study of misinformation.
- Data Scientists and Analysts developing models for sentiment analysis, trend prediction, or content classification.
- Media and Communication Professionals tracking news cycles and the public response to various events.
- Cultural Studies Scholars exploring the role of humour, memes, and viral content in contemporary society.
Dataset Name Suggestions
- Gubbio Incident Social Media Discourse
- Italian Viral Fake News Dataset
- Gubbio Raw Fish Tweets
- Social Media Reaction to Gubbio Dysentery Hoax
- Viral Italian Humour Data 2022
Attributes
Original Data Source: Fake news: the Gubbio raw fish incident