Ernesto Castro Digital Legacy Dataset
E-commerce & Online Transactions
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset preserves the digital legacy of Ernesto Castro, an influential Spanish philosopher and writer, following his announced retirement and the potential deletion of his YouTube channel. It contains over a decade of content focused on philosophy, art history, and Warhammer 40,000. The dataset makes hundreds of hours of his lectures, dialogues, conferences, and other video content accessible, safeguarding his contributions to Spanish and Ibero-American thought for future generations.
Columns
- ID_Video: The unique YouTube identifier for each video.
- Title: The title of the YouTube video.
- Description: The textual description provided for the video.
- Channel: The name of the YouTube channel where the video was uploaded.
- UploadDate: The date when the video was uploaded to YouTube.
- URL: The direct link to the YouTube video.
- Length: The duration of the video, measured in seconds.
- Views: The total number of views the video has accumulated.
- Likes: The number of positive reactions (likes) the video has received.
- Dislikes: The number of negative reactions (dislikes) the video has received.
- Comments: The content of comments left on the video.
- QComments: A distinct category for comments, potentially signifying queried comments.
- Transcription: A textual record of the spoken content within the video.
Distribution
The dataset is typically provided in a CSV file format and includes data from 544 distinct episodes or videos. It covers a wide range of values for various metrics. For instance, video lengths vary from 23 to over 36,000 seconds, with most falling between 3,623 and 7,223 seconds. Views range from 595 to over 521,000, predominantly between 595 and 52,666. Likes are generally between 0 and 1,044, while dislikes are typically 0, with a few outliers up to 4,228. The number of unique values for video IDs, titles, and URLs is 545, whereas for descriptions and channels it is 544.
Usage
This dataset is ideal for a variety of applications and research purposes:
- Data Preprocessing: Useful for cleaning data, removing special characters, and tokenising text from video descriptions and transcriptions.
- Text Mining: Enables keyword analysis and topic detection within the video descriptions and full transcriptions.
- Natural Language Processing (NLP): Facilitates text modelling, sentiment analysis, summary generation, and content classification of the video discussions.
- Machine Learning: Supports the prediction of audience engagement based on metrics such as views, likes, and dislikes. It can also be used for classifying videos based on their descriptions and transcriptions.
Coverage
The dataset's geographic scope is global, with a particular emphasis on Spanish and Ibero-American thought, reflecting Ernesto Castro's background and influence. The data spans a significant period, beginning on 23rd April 2013 and concluding on 6th January 2025, encompassing over a decade of content creation. The upload dates show a consistent output throughout these years, with multiple videos uploaded annually across various intervals. The content reaches millions of viewers, highlighting its broad appeal.
License
CC0
Who Can Use It
This dataset is intended for a diverse group of users, including:
- Data Scientists and Analysts: For exploring trends in online content engagement and performing quantitative analysis.
- Natural Language Processing Researchers: To develop and test new algorithms for text analysis, sentiment recognition, and content summarisation.
- Machine Learning Engineers: For training models to predict audience interaction or classify video content.
- Academic Researchers: Those studying contemporary philosophy, art history, digital humanities, or the impact of online media on cultural discourse.
- Social Media Analysts: To understand content performance and audience behaviour on platforms like YouTube.
- Students and Educators: As a resource for learning about philosophy, art, and digital content creation.
Dataset Name Suggestions
- Ernesto Castro YouTube Archive
- Spanish Philosopher Video Transcripts and Metadata
- Ernesto Castro Digital Legacy Dataset
- Ernesto Castro (Transcripts, Comments) | 544 Episode Collection
- Philosophical Video Content Analysis Dataset
Attributes
Original Data Source: Ernesto Castro (Transcripts, Comments) | 544 Ep