Opendatabay APP

Ernesto Castro Digital Legacy Dataset

E-commerce & Online Transactions

Tags and Keywords

Business

Nlp

Text

Mining

Culture

And

Humanities

Generation

Video

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Ernesto Castro Digital Legacy Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset preserves the digital legacy of Ernesto Castro, an influential Spanish philosopher and writer, following his announced retirement and the potential deletion of his YouTube channel. It contains over a decade of content focused on philosophy, art history, and Warhammer 40,000. The dataset makes hundreds of hours of his lectures, dialogues, conferences, and other video content accessible, safeguarding his contributions to Spanish and Ibero-American thought for future generations.

Columns

  • ID_Video: The unique YouTube identifier for each video.
  • Title: The title of the YouTube video.
  • Description: The textual description provided for the video.
  • Channel: The name of the YouTube channel where the video was uploaded.
  • UploadDate: The date when the video was uploaded to YouTube.
  • URL: The direct link to the YouTube video.
  • Length: The duration of the video, measured in seconds.
  • Views: The total number of views the video has accumulated.
  • Likes: The number of positive reactions (likes) the video has received.
  • Dislikes: The number of negative reactions (dislikes) the video has received.
  • Comments: The content of comments left on the video.
  • QComments: A distinct category for comments, potentially signifying queried comments.
  • Transcription: A textual record of the spoken content within the video.

Distribution

The dataset is typically provided in a CSV file format and includes data from 544 distinct episodes or videos. It covers a wide range of values for various metrics. For instance, video lengths vary from 23 to over 36,000 seconds, with most falling between 3,623 and 7,223 seconds. Views range from 595 to over 521,000, predominantly between 595 and 52,666. Likes are generally between 0 and 1,044, while dislikes are typically 0, with a few outliers up to 4,228. The number of unique values for video IDs, titles, and URLs is 545, whereas for descriptions and channels it is 544.

Usage

This dataset is ideal for a variety of applications and research purposes:
  • Data Preprocessing: Useful for cleaning data, removing special characters, and tokenising text from video descriptions and transcriptions.
  • Text Mining: Enables keyword analysis and topic detection within the video descriptions and full transcriptions.
  • Natural Language Processing (NLP): Facilitates text modelling, sentiment analysis, summary generation, and content classification of the video discussions.
  • Machine Learning: Supports the prediction of audience engagement based on metrics such as views, likes, and dislikes. It can also be used for classifying videos based on their descriptions and transcriptions.

Coverage

The dataset's geographic scope is global, with a particular emphasis on Spanish and Ibero-American thought, reflecting Ernesto Castro's background and influence. The data spans a significant period, beginning on 23rd April 2013 and concluding on 6th January 2025, encompassing over a decade of content creation. The upload dates show a consistent output throughout these years, with multiple videos uploaded annually across various intervals. The content reaches millions of viewers, highlighting its broad appeal.

License

CC0

Who Can Use It

This dataset is intended for a diverse group of users, including:
  • Data Scientists and Analysts: For exploring trends in online content engagement and performing quantitative analysis.
  • Natural Language Processing Researchers: To develop and test new algorithms for text analysis, sentiment recognition, and content summarisation.
  • Machine Learning Engineers: For training models to predict audience interaction or classify video content.
  • Academic Researchers: Those studying contemporary philosophy, art history, digital humanities, or the impact of online media on cultural discourse.
  • Social Media Analysts: To understand content performance and audience behaviour on platforms like YouTube.
  • Students and Educators: As a resource for learning about philosophy, art, and digital content creation.

Dataset Name Suggestions

  • Ernesto Castro YouTube Archive
  • Spanish Philosopher Video Transcripts and Metadata
  • Ernesto Castro Digital Legacy Dataset
  • Ernesto Castro (Transcripts, Comments) | 544 Episode Collection
  • Philosophical Video Content Analysis Dataset

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

27/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format