Opendatabay APP

The Social World of NFTs Dataset

Social Media and Networking

Tags and Keywords

Currencies

And

Foreign

Exchange

Nlp

Art

Trusted By
Trusted by company1Trusted by company2Trusted by company3
The Social World of NFTs Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset captures the social discourse surrounding Non-Fungible Tokens (NFTs) on Reddit. It includes posts and comments that mention the term 'NFT' in their body or title, offering valuable insights into public sentiment and discussions around this emerging technology. The dataset serves as a resource for understanding the popularity of NFTs, the controversies associated with them, and how public perception and interest have evolved over time. It is particularly useful for analysing the language used by Reddit users to understand their attitudes and beliefs concerning NFT technology.

Columns

  • type: Denotes whether the entry is a post or a comment (String).
  • id: A unique identifier for the post or comment (String).
  • subreddit.id: A unique identifier for the subreddit (String).
  • subreddit.name: The name of the subreddit where the content was published (String).
  • subreddit.nsfw: Indicates if the subreddit is marked as Not Safe For Work (Boolean).
  • created_utc: The Unix timestamp when the post or comment was created (Integer).
  • permalink: The permanent link to the post or comment on Reddit (String).
  • body: The main text content of the post or comment (String).
  • sentiment: The calculated sentiment (e.g., positive, negative, neutral) of the post or comment (String).
  • score: The score (upvotes minus downvotes) of the post or comment (Integer).

Distribution

The dataset is typically provided in a CSV file format. It contains all posts and comments that mention the term 'NFT' in their title or body text from Reddit. While an exact total record count for the full dataset is not specified, sample insights indicate diverse unique values across various columns. For instance, the subreddit.nsfw column primarily shows false values, indicating non-NSFW content.

Usage

This dataset is ideal for:
  • Studying the phenomenon of non-fungible tokens and their impact on online communities.
  • Analysing the language used in NFT-related discussions to gauge public attitudes and beliefs.
  • Identifying popular topics and trends among Reddit users discussing NFTs by leveraging the 'score' column.
  • Tracking how interest in NFTs has changed over time using the 'created_utc' timestamp.
  • Developing and testing Natural Language Processing (NLP) models for sentiment analysis on social media data.

Coverage

The dataset has global coverage, reflecting discussions from Reddit users worldwide. The time range of the data spans approximately April 2021 to April 2022, based on the 'created_utc' timestamps provided. To ensure user privacy and prevent targeted harassment, the dataset does not include usernames.

License

CC0

Who Can Use It

  • Researchers focusing on blockchain technology, cryptocurrency trends, social media dynamics, and public perception of new technologies.
  • Data analysts interested in understanding online discussions, identifying emerging trends, and performing sentiment analysis related to digital assets.
  • Academics studying online communities, discourse analysis, and the societal impact of technological innovations.
  • Developers building applications that require insights into public opinion or a large corpus of social media text for training AI models.

Dataset Name Suggestions

  • Reddit NFT Discourse Dataset
  • NFT Social Media Sentiment
  • Reddit NFT Comment and Post Data
  • The Social World of NFTs Dataset

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format