The Social World of NFTs Dataset
Social Media and Networking
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset captures the social discourse surrounding Non-Fungible Tokens (NFTs) on Reddit. It includes posts and comments that mention the term 'NFT' in their body or title, offering valuable insights into public sentiment and discussions around this emerging technology. The dataset serves as a resource for understanding the popularity of NFTs, the controversies associated with them, and how public perception and interest have evolved over time. It is particularly useful for analysing the language used by Reddit users to understand their attitudes and beliefs concerning NFT technology.
Columns
- type: Denotes whether the entry is a post or a comment (String).
- id: A unique identifier for the post or comment (String).
- subreddit.id: A unique identifier for the subreddit (String).
- subreddit.name: The name of the subreddit where the content was published (String).
- subreddit.nsfw: Indicates if the subreddit is marked as Not Safe For Work (Boolean).
- created_utc: The Unix timestamp when the post or comment was created (Integer).
- permalink: The permanent link to the post or comment on Reddit (String).
- body: The main text content of the post or comment (String).
- sentiment: The calculated sentiment (e.g., positive, negative, neutral) of the post or comment (String).
- score: The score (upvotes minus downvotes) of the post or comment (Integer).
Distribution
The dataset is typically provided in a CSV file format. It contains all posts and comments that mention the term 'NFT' in their title or body text from Reddit. While an exact total record count for the full dataset is not specified, sample insights indicate diverse unique values across various columns. For instance, the
subreddit.nsfw
column primarily shows false
values, indicating non-NSFW content.Usage
This dataset is ideal for:
- Studying the phenomenon of non-fungible tokens and their impact on online communities.
- Analysing the language used in NFT-related discussions to gauge public attitudes and beliefs.
- Identifying popular topics and trends among Reddit users discussing NFTs by leveraging the 'score' column.
- Tracking how interest in NFTs has changed over time using the 'created_utc' timestamp.
- Developing and testing Natural Language Processing (NLP) models for sentiment analysis on social media data.
Coverage
The dataset has global coverage, reflecting discussions from Reddit users worldwide. The time range of the data spans approximately April 2021 to April 2022, based on the 'created_utc' timestamps provided. To ensure user privacy and prevent targeted harassment, the dataset does not include usernames.
License
CC0
Who Can Use It
- Researchers focusing on blockchain technology, cryptocurrency trends, social media dynamics, and public perception of new technologies.
- Data analysts interested in understanding online discussions, identifying emerging trends, and performing sentiment analysis related to digital assets.
- Academics studying online communities, discourse analysis, and the societal impact of technological innovations.
- Developers building applications that require insights into public opinion or a large corpus of social media text for training AI models.
Dataset Name Suggestions
- Reddit NFT Discourse Dataset
- NFT Social Media Sentiment
- Reddit NFT Comment and Post Data
- The Social World of NFTs Dataset
Attributes
Original Data Source:The Social World of NFTs: A Reddit Dataset